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METHODS OF SCREENING FOR COMPOUNDS THAT MODULATE THE LSR - 
LEPTIN INTERACTION AND THEIR USE IN THE PREVENTION AND TREATMENT 

OF OBESITY-RELATED DISEASES 

5 FIELD OF THE INVENTION 

The present invention relates to the field of obesity research, in particular methods of 
screening for new compounds for the treatment of obesity and obesity-related diseases and 
disorders, as well as methods of treating obesity-related diseases and disorders. To this end, the 
characterization of the interaction between a new complex receptor polypeptide, LSR (Lipolysis 
1 0 Stimulated Receptor), and one of its ligands, leptin, is described. The obesity-related diseases or 
disorders envisaged to be treated by the methods of the invention include, but are not limited to, 
anorexia, hyperlipidemias, atherosclerosis, diabetes, hypertension and syndrome X. In addition, 
and more generally, the various pathologies associated with abnormalities in the metabolism of 
cytokines, may be treated by the methods of the invention. 

15 

BACKGROUND OF THE INVENTION 
The following discussion is intended to facilitate the understanding of the invention, but 
is not intended nor admitted to be prior art to the invention. 

Obesity is a public health problem that is serious, widespread, and increasing. In the 

20 United States, 20 percent of the population is obese; in Europe, a slightly lower percentage is 
obese (Friedman (2000) Nature 404:632-634). Obesity is associated with increased risk of 
hypertension, cardiovascular disease, diabetes, and cancer as well as respiratory complications 
and osteoarthritis (Kopelman (2000) Nature 404:635-643). Even modest weight loss ameliorates 
these associated conditions. 

25 While still acknowledging that lifestyle factors including environment, diet, age and 

exercise play a role in obesity, twin studies, analyses of familial aggregation, and adoption studies 
all indicate that obesity is largely the result of genetic factors (Barsh et al (2000) Nature 404:644- 
65 1). In agreement with these studies, is the fact that an increasing number of obesity-related genes 
are being identified. Some of the more extensively studied genes include those encoding leptin (ob) 

30 and its receptor (db\ proopiomelanocortin {Pome), melanocortin-4-receptor (Mc4r\ agouti protein 
(A*), carboxypeptidase E {fat), 5-hydroxytryptamine receptor 2C (Htr2c), nescient basic helix-loop- 
helix 2 (Nhlh2) 9 prohormone convertase 1 IPCSKty, and tubby protein (tubby) (rev'd in Barsh et al 
(2000) Nature 404:644-65 1 ). 

The gene encoding leptin, one of the most widely studied obesity genes, is involved in the 

35 mechanisms of satiety (rev'd in Schwartz et al (2000) Nature 404 :661-671). Leptin is a plasma 

protein of 16 kDa produced by adipocytes (Zhang et al ((1994) Nature 372:425-432). Mice with an 
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autosomal recessive mutation in this gene (ob/ob mice) are obese and hyperphagic. Similarly, mice 
with an autosomal recessive mutation of the leptin receptor (db/db mice, for example) are also 
obese (Campfield et al (1995) Science 269:546-549). Administration of leptin to ob/ob, but not 
db/db \ mice comets their relative hyperphagia and allows normalization of their weight (Weigle 
5 (1995) J. Clin. Invest. 96:2065-2070). 

Leptin circulates in the body at levels proportional to body fat content (Considine et al 
(1 996) New Eng J Med 334 :292-295) and enters the central nervous system (CNS) at levels 
proportional to the plasma level (Schwartz et al (1996) Nature Med 2 :589-593). Leptin receptors 
are expressed by brain neurons involved in energy intake (Baskin et al (1999) Diabetes 48 :828- 
1 0 833; Cheung et al (1997) Endocrinology 138:4489-4492) and administration of leptin into the brain 
reduces food intake (Weigle (1995) J. Clin. Invest. 96:2065-2070 ; Campfield et al (1995) Science 
269:546-549), whereas its deficiency increases food intake (Zhang et al (1994) Nature 372:425- 
432). 

Despite this clear evidence of leptin's role as an adiposity signal, with only a few 
1 5 exceptions the genes encoding leptin or its ob receptor have proved to be normal in obese human 
subjects (Kopelman et al (2000) Nature 404:635-643). Furthermore, and paradoxically, the plasma 
concentrations of leptin, are abnormally high in most obese human subjects (Considine et al (1996) 
New Eng J Med 334 :292-295). 

20 SUMMARY OF THE INVENTION 

The present invention results from a focusing of the research effort on the discovery of the 
mechanisms of leptin elimination. The most widely accepted working hypothesis is that the plasma 
levels of leptin are high in obese subjects because this hormone is produced by adipose tissue 
which is increased in obese subjects. In contrast, although not wishing to be limited by any 

25 particular theory, the inventors postulated that the concentrations of leptin are increased in obese 
individuals because the clearance of this hormone is reduced. The resulting high levels of leptin 
cause a leptin resistance syndrome. Thus, the treatment of obese subjects should not be based on 
increasing leptin levels, but in normalizing leptin levels. 

The lipolysis stimulated receptor (LSR) displays a high affinity for unmodified 

30 triglyceride-rich lipoproteins and is involved in the partitioning of dietary lipids among the liver, 
adipose tissue and muscle. The instant invention stems inter alia from studies of the role of LSR 
in modulating obesity. As part of the instant invention, leptin and the leptin fragment described 
herein were found to diminish the postprandial lipemic response in db Pas /db Pas mice which lack 
the leptin OB receptor, thereby showing that leptin signaling can be independent of the OB 

35 receptor. Further, the instant invention stems from the discovery that leptin increases the activity 
of LSR, binds directly to LSR, and that leptin binding leads to leptin degradation. Although not 
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wishing to be bound by a particular theory, the link between leptin signaling and LSR suggests 
the post-prandial lipemic response in db Pa */db Pcs mice is modulated through this pathway. 

In addition, the inventors have discovered that LSR is actually at least two receptors, one 
for triglyceride-rich lipoproteins, and one for leptin. The three subunits that make up LSR, 
5 a, p, and a', actually combine in at least two ways: (1) a and P together make up the LSR 
receptor for triglyceride-rich lipoproteins, and (2) a' is a necessary part of the LSR receptor for 
leptin, that may include P as well. Thus, it is now clear that assays can be designed for 
identifying modulators or receptors/binding partners/signalling cascade members that are 
specific for the triglyceride-related activity of LSR or for the leptin-related activity of LSR or 
10 both. 

Further, the invention features the discovery of a 22 amino acid region of human leptin 
that modulates LSR activity in vitro and in vivo in the same way as the intact human leptin, thus 
allowing the use of only this critical region in assays for modulators of the leptin-LSR 
interaction, and new leptin receptors and binding partners. The new leptin fragment can also be 

15 used in disease treatment since it is active in mice at a physiologically-relevant level. In 

addition, the homologous region from mouse leptin was found to inhibit LSR activity in the 
human system, and is thus an LSR antagonist of the invention as well as being a powerful tool 
for identifying further modulators (both inhibitory and stimulatory) of LSR activity. 

In a preferred aspect, the invention features a leptin polypeptide fragment that modulates 

20 the activity of LSR, comprising at least 4, but not more than 50 contiguous amino acids of any 
one of the leptin polypeptide sequences set forth in Figure 13, wherein said at least 4 and not 
more than 50 contiguous amaino acids comprise the leptin fragment central sequence. In 
preferred embodiments, the leptin polypeptide fragment comprises at least 10 but not more than 
50, at least 20 but not more than 40, or at least 20 but not more than 30 contiguous amino acids. 

25 Alternatively, the invention features a variant of a leptin polypeptide fragment that 

modulates the activity of LSR, consisting of a 22 contiguous amino acid sequence that is at least 
75% identical to the leptin fragment variable region of any one of the leptin polypeptide 
sequences set forth in Figure 13. In preferred embodiments, the variant of a leptin polypeptide 
fragment is 85% identical, or 95% identical to the leptin fragment variable region. Preferably the 

30 leptin fragments and variants are from human or mouse leptin. 

In a second aspect, the invention features, a chimeric oligonucleotide, comprising at least 
9 contiguous nucleotides from a sequence selected from the group consisting of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO: 12, SEQ ID NO:14, 
SEQ ID NO: 15, and SEQ ID NO: 16, wherein said at least 9 contiguous nucleotides comprise at 

35 least one amino acid codon selected from the group consisting of TTA, TTG, TCA, TCG, TAU, 
TAC, TGT, TGC, TGG, CAA, CAG, AGA, GAA, GAG, and GGA, and wherein a point 
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mutation is present in said codon such that said codon is a stop codon. Alternatively, the 
chimeric oligonucleotide comprises at least 9 contiguous nucleotides of SEQ ID NO:l , wherein 
said at least 9 contiguous nucleotides comprise a single nucleotide polymorphism selected from 
the group consisting of Al to A32. 
5 In a third aspect, the invention features a zinc finger protein, comprising a DNA binding 

domain that binds specifically to 18 nucleotides of a sequence at least 50% homologous to SEQ 
ID NO:l, wherein said 18 nucleotides comprise two fragments of 9 contiguous nucleotides, and 
wherein said fragments are separated by 0, 1, 2, or 3 nucleotides. In preferred embodiments, 
said sequence is at least 50% homologous to intronic sequences selected from the group 

10 consisting of 2357 to 3539, 3885 to 12162, 12283 to 15143, 15201 to 17764, 15912 to 19578, 
19753 to 19898, 19959 to 20055, 20188 to 20328, and 20958 to 21046 of SEQ ID NO:l, 
preferably to residues 2357 to 3539 of SEQ ID NO: 1 , or alternatively 5' untranslated regions 
such as the sequence 1 to 2356 of SEQ ID NO:l. In preferred embodiments, said protein further 
comprises a functional domain selected from the group consisting of a transcription repressor 

1 5 and a transcription initiator; preferably said repressor is a KRAB repressor and said initiator is a 
VP16 initiator. In other preferred embodiments, said protein further comprises a small molecule 
regulatory system, preferably said system is selected from the group consisting of a Tet system, 
RU486, and ecdysone. 

In a fourth aspect, the invention features polynucleotides encoding the leptin polypeptide 

20 fragments and variants of the invention, or polynucleotides encoding a zinc finger protein of the 
invention. 

In a fifth aspect, the invention features recombinant vectors comprising the 
polynucleotides encoding the leptin polypeptide fragments and variants of the invention, or 
polynucleotides or recombionant vectors encoding a zinc finger protein of the invention. In 

25 preferred embodiments, said vector is an adenovirus associated virus. 

In a sixth aspect, the invention features recombinant cells comprising the 
polynucleotides and recombinant vectors encoding the leptin polypeptide fragments and variants 
of the invention, or polynucleotides and recombinant vectors encoding zinc finger proteins of the 
invention. In preferred embodiments, the recombinant cell comprising the polynucleotides and 

30 recombinant vectors encoding leptin fragments and variants and zinc finger polypeptides of the 
invention, are transfected with at least one LSR polypeptide comprising a sequence at least 75% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ED NO:17, SEQ ID 
NO: 18, and SEQ ID NO: 19. Preferably, said transfected cell is stably transfected. Preferably, 

35 said cell is selected from the group consisting of PLC, CHO-K1, Hep3B, Hepa 1-6, and HepG2. 
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In a seventh embodiment, the invention features a pharmaceutical composition 
comprising the leptin polypeptide fragments and variants of the invention, or polynucleotides or 
recombinant vectors encoding a zinc finger protein of the invention, or chimeraplasts of the 
invention. 

5 In an eighth aspect, the invention features non-human mammals comprising 

polynucleotides and recombinant vectors encoding zinc finger proteins of the invention. 
Preferably, said vector is an adenovirus associated virus. 

In a ninth aspect, the invention features a method of treating or preventing an obesity- 
related disease or disorder comprising providing to an individual in need of such treatment a 

1 0 pharmaceutical composition comprising the leptin polypeptide fragments and variants of the 
invention. Preferably, said disease is congenital generalized lipodystrophy. Alternatively, the 
patient is provided a chimeric oligonucleotide of the invention or a polynucleotide or 
recombinant vector encoding a zinc finger protein of the invention. Preferably, said providing 
comprises a liposome, and preferably said vector is an adenovirus associated virus. In preferred 

1 5 embodiments, the obesity related disease or disorder is selcted from the group consisting of 

obesity, anorexia, cachexia, cardiac insufficiency, coronary insufficiency, stroke, hypertenison, 
atheromatous disease, atherosclerosis, high blood pressure, non-insulin-dependent diabetes, 
hyperlipidemia, hyperuricemia, and Syndrome X. Preferably the individual is an animal, 
preferably a mammal, most preferably a human. 

20 In a tenth aspect, the invention features a method of designing mimetics of a leptin 

fragment that modulates an activity of LSR, comprising: identifying critical interactions 
between one or more amino acids of said leptin fragment and LSR; designing potential mimetics 
to comprise said critical interactions; and testing said potential mimetics ability to modulate said 
activity as a means for designing said mimetics. Preferably, the leptin fragment consists of the 

25 leptin fragment variable region or the leptin fragment central sequence of any one of the leptin 
polypeptide sequences set forth in Figure 13. Alternatively, the leptin fragment is any one of the 
leptin fragments or variants of the invention. Preferably, the leptin fragment or variant is from 
human or mouse leptin. In preferred embodiments, the activity of 1SR is selected from the group 
consisting of leptin binding, leptin uptake, leptin degradation, triglyceride binding, triglyceride 

30 uptake, and triglyceride degradation. Preferably the critical interactions are selected from the 
group consisting of hydrogen bonding, covalent bonding, Van der Waals forces, steric 
hindrances, and hydrophobic interactions, and are identified using assays selected from the group 
consisting of NMR, X-ray crystallography, and computer modeling. 

In an eleventh aspect, the invention features a method of inhibiting the expression of at 

35 least one subunit of LSR, comprising providing to a cell a chimeric oligonucleotide of the 

invention that changes a amino acid codon to a stop codon. Preferably, the cell is selected from 
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the group consisting of PLC, CHO-K1, HepG2, Hepa 1-6, and Hep3B. Alternatively the cell is 
in a mammal, preferably a mouse, more preferably in a human, and is provided using a liposome. 

In a related aspect, the invention features a method of modulating the expression of at 
least one subunit of LSR, comprising providing to a cell a polynucleotide encoding a zinc finger 
5 protein of the invention. Preferably, said cell is selected from the group consisting of PLC, 
CHO-K1, HepG2, Hepa 1-6, and Hep3B. Alternatively, said cell is in an animal, preferably a 
mammal, and preferably said mammal is a mouse or a human. 

In a twelfth aspect, the invention features a method for selecting a compound useful for 
the treatment or prevention of an obesity-related disease or disorder, comprising: contacting a 

1 0 recombinant cell comprising a polynucleotide or recombinant vector encoding a zinc finger 
protein of the invention, and that optionally further comprises at least one LSR polypeptide 
comprising a sequence at least 75% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ 
ID NO: 13, SEQ ID NO: 1 7, SEQ ID NO: 1 8, and SEQ ID NO: 1 9, with a candidate compound; 

1 5 and detecting a result selected from the group consisting of a modulation of an activity of the 
Lipolysis Stimulated Receptor and modulation of expression of the Lipolysis Stimulated 
Receptor; as a means for selecting said compound useful for the treatment or prevention of said 
obesity-related disease or disorder. In preferred embodiments, said contacting is in the presence 
of a ligand of said Lipolysis Stimulated Receptor. Preferably, said ligand is selected from the 

20 group consisting of cytokine, lipoprotein, free fatty acids, Apml, and Clq. Most preferably said 
cytokine is leptin, or a leptin polypeptide fragment or variant of the invention. Alternatively said 
free fatty acid is oleate. 

In preferred embodiments, said LSR activity is selected from the group consisting of 
binding of lipoproteins, uptake of lipoproteins, degradation of lipoproteins, binding of leptin, 

25 uptake of leptin, and degradation of leptin. Preferably said modulation is an increase in said 

activity, alternatively a decrease in activity. In other preferred embodiments, said expression is 
on the surface of said cell, and preferably said detecting comprises FACS. Preferably, said 
detecting further comprises antibodies that bind specifically to said LSR, wherein said LSR 
comprises an amino acid sequence at least 75% homologous to at least one of the sequences 

30 selected from the group consisting of SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 17, SEQ ID NO: 18, and SEQ ID NO: 19 
Most preferably, said antibodies bind specifically to a region of said LSR selected from the 
group consisting of an amino terminus, a carboxy terminus, a splice site, a cytokine binding site, 
a fatty acid binding site, a clathrin binding site, an apoprotein ligand binding site, a LI/LL motif, 

35 a RSRS motif, and a hydrophobic region. Preferably, said cell is selected from the group 
consisting of PLC, CHO-K1, Hep3B, Hepa 1-6, and HepG2. 
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In other preferred embodiments, said candidate compound is selected from the group 
consisting of peptides, peptide libraries, non-peptide libraries, peptoids, fatty acids, lipoproteins, 
medicaments, antibodies, and small molecules. Preferably, said obesity-related diseases and 
disorders are selected from the group consisting of obesity, anorexia, cachexia, cardiac 
5 insufficiency, coronary insufficiency, stroke, hypertension, atheromatous disease, 
atherosclerosis, high blood pressure, non-insulin-dependent diabetes, hyperlipidemia, 
hyperuricemia, and Syndrome X. 

In a thirteenth aspect, the invention features a method of selecting for genes that 
modulate an activity of the Lipolysis Stimulated Receptor, comprising : providing a retroviral 

10 gene library to cells that express said Lipolysis Stimulated Receptor; contacting said cells with a 
ligand of said Lipolysis Stimulated Receptor ; detecting a change in said activity of the Lipolysis 
Stimulated Receptor as a means for selecting for said genes. In preferred embodiments, said 
retroviral gene library comprises a cDNA library from tissues selected from the group consisting 
of liver and adipose. Preferably, said retroviral gene library further comprises a detectable 

1 5 marker protein selected from the group consisting of GFP, truncated CD2, and truncated CD4. 
In other preferred embodiments, the invention further comprises selecting said cells comprising 
the retroviral gene library for moderate expression of GFP; preferably said selecting of cells is by 
FACS. 

In other preferred embodiments, said ligand is selected from the group consisting of 
20 cytokine, lipoprotein, free fatty acids, Apml, and Clq. Most preferably said cytokine is leptin, 
or a leptin polypeptide fragment or variant of the invention. Alternatively said free fatty acid is 
oleate. 

In yet other preferred embodiments, preferably said detecting a change in said activity 
comprises FACS. Preferably, said detecting further comprises antibodies that bind specifically 

25 to said LSR, wherein said LSR comprises an amino acid sequence at least 75% homologous to at 
least one of the sequences selected from the group consisting of SEQ ID NO:3, SEQ ID NO:5, 
SEQ ED NO:7, SEQ ID NO:9, SEQ ED NO: 1 1 , SEQ ID NO: 1 3, SEQ ID NO: 1 7, SEQ ID NO: 1 8, 
and SEQ ID NO: 19 Most preferably, said antibodies bind specifically to a region of said LSR 
selected from the group consisting of an amino terminus, a carboxy terminus, a splice site, a 

30 cytokine binding site, a fatty acid binding site, a clathrin binding site, an apoprotein ligand 
binding site, a LI/LL motif, a RSRS motif, and a hydrophobic region. Preferably, said cell is 
selected from the group consisting of PLC, CHO-K1, Hep3B, Hepa 1-6, and HepG2. 

DETAILED DESCRIPTION OF THE INVENTION 



35 



LSR (Lipolysis Stimulated Receptor), which is described in PCT publication No WO 
IB98/01257 (hereby incoporated by reference herein in its entirety including any figures, tables, 
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or drawings), is expressed on the surface of hepatic cells, and is involved in the partitioning of 
dietary lipids between the liver and peripheral tissues, including muscles and adipose tissue. The 
LSR gene encodes, by alternative splicing, three types of subunits, LSR a, LSR a\ and LSR (3. 
The a' subunit specifically binds a cytokine, leptin, which activates LSR and is taken up and 
5 degraded. The invention is drawn inter alia to compounds that modulate the interaction 
between LSR and leptin useful in the treatment or prevention of obesity-related diseases and 
disorders. 

Definitions 

1 0 Before describing the invention in greater detail, the following definitions are set forth to 

illustrate and define the meaning and scope of the terms used to describe the invention herein. 

As used interchangeably herein, the terms "oligonucleotides", and "polynucleotides" 
include RNA, DNA, or RNA/DNA hybrid sequences of more than one nucleotide in either single 
chain or duplex form. The terms "nucleotide", "nucleotide sequence" and "nucleic acid" are 

1 5 used herein consistently with their use in the art, including to encompass "modified nucleotides" 
which comprise at least one modification, including by way of example and not limitation: (a) an 
alternative linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, 
or (d) an analogous sugar. For examples of analogous linking groups, purines, pyrimidines, and 
sugars see for example PCT publication No. WO 95/04064. The polynucleotide sequences of 

20 the invention may be prepared by any known method, including synthetic, recombinant, ex vivo 
generation, or a combination thereof, as well as utilizing any purification methods known in the 
art. 

The terms polynucleotide construct, recombinant polynucleotide and recombinant 
polypeptide are used herein consistently with their use in the art. The terms "upstream" and 

25 "downstream" are also used herein consistently with their use in the art. The terms "base 

paired" and "Watson & Crick base paired" are used interchangeably herein and consistently with 
their use in the art. Similarly, the terms "complementary", "complement thereof , 
"complement", "complementary polynucleotide", "complementary nucleic acid" and 
"complementary nucleotide sequence" are used interchangeably herein and consistently with 

30 their use in the art. 

The term "purified" is used herein to describe a polynucleotide or polynucleotide vector 
of the invention that has been separated from other compounds including, but not limited to, 
other nucleic acids, carbohydrates, lipids and proteins (such as the enzymes used in the synthesis 
of the polynucleotide). Purified can also refer to the separation of covalently closed 

35 polynucleotides from linear polynucleotides, or vice versa, for example. A polynucleotide is 
substantially pure when at least about 50%, 60%, 75%, or 90% of a sample contains a single 
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polynucleotide sequence. In some cases this involves a determination between conformations 
(linear versus covalently closed). A substantially pure polynucleotide typically comprises about 
50, 60, 70, 80, 90, 95, 99% weight/weight of a nucleic acid sample. Polynucleotide purity or 
homogeneity may be indicated by a number of means well known in the art, such as agarose or 
5 polyacrylamide gel electrophoresis of a sample, followed by visualizing a single polynucleotide 
band upon staining the gel. For certain purposes higher resolution can be provided by using 
HPLC or other means well known in the art. 

Similarly, the term "purified" is used herein to describe a polypeptide of the invention 
that has been separated from other compounds including, but not limited to, nucleic acids, lipids, 

10 carbohydrates and other proteins. In some preferred embodiments, a polypeptide is substantially 
pure when at least about 50%, 60%, 75%, 85%, 90%, or 95% of a sample exhibits a single 
polypeptide sequence. In some preferred embodiments, a substantially pure polypeptide 
typically comprises about 50%, 60%, 70%, 80%, 90% 95%, or 99% weight/weight of a protein 
sample. Polypeptide purity or homogeneity is indicated by a number of methods well known in 

15 the art, such as agarose or polyacrylamide gel electrophoresis of a sample, followed by 
visualizing a single polypeptide band upon staining the gel. For certain purposes higher 
resolution can be provided by using HPLC or other methods well known in the art. 

Further, as used herein, the term "purified" does not require absolute purity; rather, it is 
intended as a relative definition. Purification of starting material or natural material to at least one 

20 order of magnitude, preferably two or three orders, and more preferably four or five orders of 
magnitude is expressly contemplated. Alternatively, purification may be expressed as "at least" a 
percent purity relative to heterologous polynucleotides (DNA, RNA or both) or polypeptides. As a 
preferred embodiment, the polynucleotides or polypeptides of the present invention are at least; 
10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% pure 

25 relative to heterologous polynucleotides or polypeptides. As a further preferred embodiment the 
polynucleotides or polypeptides have an "at least" purity ranging from any number, to the 
thousandth position, between 90% and 100% (e.g., at least 99.995% pure) relative to heterologous 
polynucleotides or polypeptides. Additionally, purity of the polynucleotides or polypeptides may 
be expressed as a percentage (as described above) relative to all materials and compounds other 

30 than the carrier solution. Each number, to the thousandth position, may be claimed as individual 
species of purity. 

The term "isolated" requires that the material be removed from its original environment 
(e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the same 
35 polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials in 
the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 
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polynucleotide or polypeptide could be part of a composition, and still be isolated in that the 
vector or composition is not part of its natural environment. 

• Specifically excluded from the definition of "isolated*' are: naturally occurring 
chromosomes (e.g., chromosome spreads), artificial chromosome libraries, genomic libraries, and 
5 cDNA libraries that exist either as an in vitro nucleic acid preparation or as a 
transfected/transformed host cell preparation, wherein the host cells are either an in vitro 
heterogeneous preparation or plated as a heterogeneous population of single colonies. Also 
specifically excluded are the above libraries wherein a 5' EST makes up less than 5% of the 
number of nucleic acid inserts in the vector molecules. Further specifically excluded are whole cell 

10 genomic DNA or whole cell RNA preparations (including said whole cell preparations which are 
mechanically sheared or enzymatically digested). Further specifically excluded are the above 
whole cell preparations as either an in vitro preparation or as a heterogeneous mixture separated by 
electrophoresis (including blot transfers of the same) wherein the polynucleotide of the invention 
have not been further separated from the heterologous polynucleotides in the electrophoresis 

1 5 medium (e.g., further separating by excising a single band from a heterogeneous band population in 
an agarose gel or nylon blot). 

The term "primer" denotes a specific oligonucleotide sequence which is complementary 
to a target nucleotide sequence and used to hybridize to the target nucleotide sequence. A primer 
serves as an initiation point for nucleotide polymerization catalyzed by DNA polymerase, RNA 

20 polymerase, or reverse transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide analog segment, 
e.g., PNA as defined hereinbelow) which can be used to identify a specific polynucleotide 
sequence present in a sample, said nucleic acid segment comprising a nucleotide sequence 
complementary to the specific polynucleotide sequence to be identified. 

25 The term "polypeptide" refers to a polymer of amino acids without regard to the length 

of the polymer. Thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide. This term also does not specify or exclude post-expression modifications of 
polypeptides. For example, polypeptides that include the covalent attachment of glycosyl 
groups, acetyl groups, phosphate groups, lipid groups and the like are expressly encompassed by 

30 the term polypeptide. Also included within the definition are polypeptides which contain one or 
more analogs of an amino acid (including, for example, non-naturally occurring amino acids, 
amino acids which only occur naturally in an unrelated biological system, modified amino acids 
from mammalian systems etc.), polypeptides with substituted linkages, as well as other 
modifications known in the art, both naturally occurring and non-naturally occurring. 

35 Without being limited by theory, the compounds/polypeptides of the invention are 

believed to treat "diseases involving the partitioning of dietary lipids between the liver and 
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peripheral tissues The term "peripheral tissues" is meant to include muscle and adipose tissue. 
In preferred embodiments, the compounds/polypeptides of the invention partition the dietary 
lipids toward the muscle. In alternative preferred embodiments, the dietary lipids are partitioned 
toward the adipose tissue. In other preferred embodiments, the dietary lipids are partitioned 

5 toward the liver. In yet other preferred embodiments, the compounds/polypeptides of the 
invention increase or decrease the oxidation of dietary lipids, preferably free fatty acids (FFA) 
by the muscle. Dietary lipids include, but are not limited to triglycerides and free fatty acids. 

Preferred diseases believed to involve the partitioning of dietary lipids include obesity 
and obesity-related diseases and disorders such as atherosclerosis, heart disease, insulin 

10 resistance, hypertension, stroke, Syndrome X, and Type II diabetes. Type II diabetes-related 
complications to be treated by the methods of the invention include microangiopathic lesions, 
ocular lesions, and renal lesions. Heart disease includes, but is not limited to, cardiac 
insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders 
to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet 

15 other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS- 
related weight loss, neoplasia-related weight loss, anorexia, and bulimia. 

The term "obesity" as used herein is defined in the WHO classifications of weight 
(Kopelman (2000) Nature 404:635-643). Underweight is less than 18.5 (thin); Healthy is 18.5- 
24.9 (normal); grade 1 overweight is 25.0-29.9 (overweight); grade 2 overweight is 30.0-39.0 

20 (obesity); grade 3 overweight is greater than or equal to 40.0 BMI (morbid obesity). BMI is 
body mass index and is kg/m 2 . Waist circumference can also be used to indicate a risk of 
metabolic complications where in men a circumference of greater than or equal to 94 cm 
indicates an increased risk, and greater than or equal to 102 cm indicates a substantially 
increased risk. Similarly for women, greater than or equal to 88 cm indicates an increased risk, 

25 and greater than or equal to 88 cm indicates a substantially increased risk. The waist 

circumference is measured in cm at midpoint between lower border of ribs and upper border of 
the pelvis. Other measures of obesity include, but are not limited to, skinfold thickness which is 
a measurement in cm of skinfold thickness using calipers, and bioimpedance, which is based on 
the principle that lean mass conducts current better than fat mass because it is primarily an 

30 electrolyte solution; measurement of resistance to a weak current (impedance) applied across 
extremities provides an estimate of body fat using an empirically derived equation. 

The term "agent acting on the partitioning of dietary lipids between the liver and 
peripheral tissues'* refers to a compound or polypeptide of the invention that modulates the 
partitioning of dietary lipids between the liver and the peripheral tissues as previously described. 

35 Preferably, the agent increases or decreases the oxidation of dietary lipids, preferably free fatty 
acids (FFA) by the muscle. Preferably the agent decreases or increases the body weight of 
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individuals or is used to treat or prevent an obesity-related disease or disorder such as 
atherosclerosis, heart disease, insulin resistance, hypertension, stroke, Syndrome X, and Type II 
diabetes. Type II diabetes-related complications to be treated by the methods of the invention 
include, but are not limited to, microangiopathic lesions, ocular lesions, and renal lesions. Heart 
5 disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high 
blood pressure. Other obesity-related disorders to be treated by compounds of the invention 
include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the 
invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. 

The terms "response to an agent acting on the partitioning of dietary lipids between the 

10 liver and peripheral tissues " refer to drug efficacy, including but not limited to, ability to 
metabolize a compound, to the ability to convert a pro-drug to an active drug, and to the 
pharmacokinetics (absorption, distribution, elimination) and the pharmacodynamics (receptor- 
related) of a drug in an individual. 

The terms "side effects to an agent acting on the partitioning of dietary lipids between 

15 the liver and peripheral tissues " refer to adverse effects of theraipy resulting from extensions of 
the principal pharmacological action of the drug or to idiosyncratic adverse reactions resulting 
from an interaction of the drug with unique host factors. "Side effects to an agent acting on the 
partitioning of dietary lipids between the liver and peripheral tissues " can include, but are not 
limited to, adverse reactions such as dermatologic, hematologic or hepatologic toxicities and 

20 further includes gastric and intestinal ulceration, disturbance in platelet function, renal injury, 
nephritis, vasomotor rhinitis with profuse watery secretions, angioneurotic edema, generalized 
urticaria, and bronchial asthma to laryngeal edema and bronchoconstriction, hypotension, and 
shock. 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
25 which are comprised of at least one binding domain, where an antibody binding domain is 

formed from the folding of variable domains of an antibody molecule to form three-dimensional 
binding spaces with an internal surface shape and charge distribution complementary to the 
features of an antigenic determinant of an antigen, and that allows an immunological reaction 
with the antigen. Antibodies include recombinant proteins comprising the antibody binding 
30 domains, as well as fragments, including Fab, Fab', F(ab)2, and F(ab')2 fragments. 

As used herein, an "antigenic determinant" is the portion of an antigen molecule, in this 
case an LSR polypeptide, that determines the specificity of the antigen-antibody reaction. An 
"epitope" refers to an antigenic determinant of a polypeptide. An epitope can comprise as few as 
3 amino acids in a spatial conformation which is unique to the epitope. Generally an epitope 
35 consists of at least 6 such amino acids, and more usually at least 8-10 such amino acids. 
Methods for determining the amino acids which make up an epitope include x-ray 
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crystallography, 2-dimensional nuclear magnetic resonance, and epitope mapping e.g. the 
Pepscan method described by H. Mario Geysen et al. 1984. Proc. Natl. Acad. Sci. U.S.A. 
81:3998-4002; PCT Publication No. WO 84/03564; and PCT Publication No. WO 84/03506. 
The term "compound" as used herein refers to molecules, either organic or inorganic, 
5 that can be tested for activity in an assay. Preferably, compounds have a low molecular weight 
of less than 500 kda, some compounds can have a molecular weight between 500 and 1500, other 
compounds may have a molecular weight of at least 1500 kda. In addition, compounds of 
interest preferably have a desired activity at a low concentration, e.g. a compound that is active 
at a concentration of 1 ng/mL or less, is generally preferred over one that is active at 1 ng/mL to 

10 100 ng/mL, or one that is active only at concentrations greater than 100 ng/mL. Examples of 
compounds to be tested in the assays herein include: peptides, peptide libraries, non-peptide 
libraries, antibodies, and peptoids. 

The term "activity" as used herein refers to a measurable result of the interaction of 
molecules. For example, some LSR activities include leptin binding, leptin uptake, leptin 

1 5 degradation, as well as triglyceride binding, triglyceride uptake, and triglyceride degradation. 
Some exemplary methods of measuring these activities are provided herein. 

The term "modulate" as used herein refers to the ability of a compound to change an 
activity in some measurable way as compared to an appropriate control. As a result of the 
presence of compounds in the assays, activities can increase (e.g. there could be increased levels 

20 of leptin binding), or "decrease" (e.g. there could be decreased levels of leptin binding) as 

compared to controls in the absence of these compounds. Preferably, an increase in activity is at 
least 25%, more preferably at least 50%, most preferably at least 100% compared to the level of 
activity in the absence of the compound. Similarly, a decrease in activity is preferably at least 
25%, more preferably at least 50%, most preferably at least 100% compared to the level of 

25 activity in the absence of the compound. A compound that increases a known activity is an 
"agonist". One that decreases, or prevents, a known activity is an "antagonist". 

The term "monitoring" as used herein refers to any method in the art by which an 
activity can be measured. For each of the activities in the assays of the invention, exemplary 
methods are provided in the Examples section. 

30 The term "providing" as used herein refers to any means of adding a compound or 

molecule to something known in the art. Examples of providing can include the use of pipets, 
pipettmen, syringes, needles, tubing, guns, etc. This can be manual or automated. It can include 
transfection by any mean or any other means of providing nucleic acids to dishes, cells, tissue, 
cell-free systems and can be in vitro or in vivo. Methods are provided in the Examples section as 

35 examples. 
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The term "LSR-related diseases and disorders" as used herein refers to any disease or 
disorder or condition comprising an aberrant functioning of LSR, or a subunit(s) of LSR, to 
include aberrant levels of expression of LSR, or a subunit(s) of LSR (either increased or 
decreased), aberrant activity of LSR (either increased or decreased), and aberrant interactions 

5 with ligands or binding partners (either increased or decreased). By "aberrant" is meant a 
change from the type, or level of activity seen in normal cells, tissues, or individuals, or seen 
previously in the cell, tissue, or individual prior to the onset of the illness. 

The term "cosmetic treatments" is meant to include treatments with compounds or 
polypeptides of the invention that increase or decrease the body mass of an individual where the 

10 individual is not clinically obese or clinically thin. Thus, these individuals have a body mass 

index (BMI) below the cut-off for clinical obesity (e.g. below 25 kg/m 2 ) and above the cut-off for 
clinical thinness (e.g. above 18.5 kg/m 2 ). In addition, these individuals are preferably healthy 
(e.g. do not have an obesity-related disease or disorder of the invention). "Cosmetic treatments" 
are also meant to encompass, in some circumstances, more localized increases in adipose tissue, 

15 for example, gains or losses specifically around the waist or hips, or around the hips and thighs, 
for example. These localized gains or losses of adipose tissue can be identified by increases or 
decreases in waist or hip size, for example. 

The term "preventing" as used herein refers to administering a compound prior to the 
onset of clinical symptoms of a disease or conditions so as to prevent a physical manifestation of 

20 aberrations associated with obesity or LSR. 

The term "treating" as used herein refers to administering a compound after the onset of 
clinical symptoms. 

The term "in need of treatment" as used herein refers to a judgment made by a caregiver 
(e.g. physician, nurse, nurse practitioner, etc in the case of humans; veterinarian in the case of 
25 animals, including non-human mammals) that an individual or animal requires or will benefit 
from treatment. This judgment is made based on a variety of factors that are in the realm of a 
care giver's expertise, but that include the knowledge that the individual or animal is ill, or will 
be ill, as the result of a condition that is treatable by the compounds of the invention. 

The term "perceives a need for treatment" refers to a sub-clinical determination that an 
30 individual desires to reduce weight for cosmetic reasons as discussed under "cosmetic treatment" 
above. The term "perceives a need for treatment" in other embodiments can refer to the decision 
that an owner of an animal makes for cosmetic treatment of the animal. 

The term "individual" as used herein refers to a mammal, including animals, preferably 
mice, rats, other rodents, rabbits, dogs, cats, swine, cattle, sheep, horses, or primates, most 
35 preferably humans. 
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The term "non-human animal" refers to any non-human vertebrate, birds and more 
usually mammals, preferably primates, animals such as swine, goats, sheep, donkeys, horses, 
cats, dogs, rabbits or rodents, more preferably rats or mice. Both the terms "animal" and 
"mammal" expressly embrace human subjects unless preceded with the term "non-human". 
5 The terms "percentage of sequence identity" and "percentage homology" are used 

interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and are 
determined by comparing two optimally aligned sequences over a comparison window, wherein 
the portion of the polynucleotide or polypeptide sequence in the comparison window may 
comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does 

10 not comprise additions or deletions) for optimal alignment of the two sequences. The percentage 
is calculated by determining the number of positions at which the identical nucleic acid base or 
amino acid residue occurs in both sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of positions in the window of comparison 
and multiplying the result by 100 to yield the percentage of sequence identity. Homology is 

1 5 evaluated using any of the variety of sequence comparison algorithms and programs known in 
the art. Such algorithms and programs include, but are by no means limited to, TBLASTN, 
BLASTP, FASTA, TFASTA, and CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. 
Sci. USA 85(8):2444-2448; Altschul et al., 1990, J. Mol. Biol. . 215(3):403-410) Thompson et 
al., 1994, Nucleic Acids Res. 22(2):4673-4680; Higgins et al., 1996, Methods Enzymol. 266:383- 

20 402; Altschul et al., 1990, J. Mol. Biol. 215(3):403-410; Altschul et al., 1993, Nature Genetics 
3:266-272). In a particularly preferred embodiment, protein and nucleic acid sequence 
homologies are evaluated using the Basic Local Alignment Search Tool ("BLAST") which is 
well known in the art (see e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 87:2267- 
2268; Altschul et al., 1990, J. Mol. Biol. 215:403-410; Altschul et al., 1993, Nature Genetics 

25 3:266-272; Altschul et al., 1997, Nuc. Acids Res. 25:3389-3402. In particular, five specific 
BLAST programs are used to perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 
sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
30 database; 

(3) BLASTX compares the six-frame conceptual translation products of a query 
nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence 
database translated in all six reading frames (both strands); and 

35 (5) TBLASTX compares the six-frame translations of a nucleotide query sequence 

against the six-frame translations of a nucleotide sequence database. 
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The BLAST programs identify homologous sequences by identifying similar segments, 
which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic 
acid sequence and a test sequence which is preferably obtained from a protein or nucleic acid 
sequence database. High-scoring segment pairs are preferably identified (i.e., aligned) by means 
5 of a scoring matrix, many of which are known in the art. Preferably, the scoring matrix used is 
the BLOSUM62 matrix (Gonnet et al., 1992, Science 256:1443-1445; Henikoff and Henikoff, 
1993, Proteins 17:49-61. Less preferably, the PAM or PAM250 matrices may also be used (see, 
e.g., Schwartz and Dayhoff, eds., 1978, Matrices for Detecting Distance Relationships: Atlas of 
Protein Sequence and Structure, Washington: National Biomedical Research Foundation). The 

1 0 BLAST programs evaluate the statistical significance of all high-scoring segment pairs 

identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance 
of a high-scoring segment pair is evaluated using the statistical significance formula of Karlin 
(see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 87:2267-2268). 

15 By way of example and not limitation, procedures using conditions of high stringency 

are as follows: Prehybridization of filters containing DNA is carried out for 8 h to overnight at 
65°C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 
0.02% Ficoll, 0.02% BSA, and 500 ng/mL denatured salmon sperm DNA. Filters are 
hybridized for 48 h at 65 °C, the preferred hybridization temperature, in prehybridization 

20 mixture containing 100 ng/mL denatured salmon sperm DNA and 5-20 X 10 6 cpm of 

32 P-labeled probe. Alternatively, the hybridization step can be performed at 65 °C in the 
presence of SSC buffer, 1 x SSC corresponding to 0.15 M NaCl and 0.05 M Na citrate. 
Subsequently, filter washes can be done at 37 °C for 1 h in a solution containing 2 x SSC, 
0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1 X SSC at 50 °C for 45 

25 min. Alternatively, filter washes can be performed in a solution containing 2 x SSC and 0.1 % 
SDS, or 0.5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68 °C for 15 minute 
intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. 
Other conditions of high stringency that may be used are well known in the art (see, for 
example, Sambrook et al., 1989; and Ausubel et al., 1989). These hybridization conditions are 

30 suitable for a nucleic acid molecule of about 20 nucleotides in length. A person of ordinary 
skill in the art will realize that the hybridization conditions described above are to be adapted 
according to the length of the desired nucleic acid following techniques well known to the one 
skilled in the art. Suitable hybridization conditions may for example be adapted according to 
the teachings disclosed in the book of Hames and Higgins (1985) or in Sambrook et al.(1989). 

35 Variants 
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It will be recognized by one of ordinary skill in the art that some amino acids of the 
polypeptide sequences of the present invention can be varied without significant effect on the 
structure or function of the protein; there will be critical amino acids in the polypeptide sequence 
that determine activity. Thus, the invention further includes variants of polypeptides. Such 
5 variants include polypeptide sequences with one or more amino acid deletions, insertions, 
inversions, repeats, and substitutions either from natural mutations or human manipulation 
selected according to general rules known in the art so as to have little effect on activity. 
Guidance concerning how to make phenotypically silent amino acid substitutions is provided 
below. 

10 There are two main approaches for studying the tolerance of an amino acid sequence to 

change (See, Bowie, J. U. et al. 1990). The first method relies on the process of evolution, in 
which mutations are either accepted or rejected by natural selection. The second approach uses 
genetic engineering to introduce amino acid changes at specific positions of a cloned gene and 
selections or screens to identify sequences that maintain functionality. 

15 These studies have revealed that proteins are surprisingly tolerant of amino acid 

substitutions and indicate which amino acid changes are likely to be permissive at a certain 
position of the protein. For example, most buried amino acid residues require nonpolar side 
chains, whereas few features of surface side chains are generally conserved. Other such 
phenotypically silent substitutions are described by Bowie et al. (supra) and the references cited 

20 therein. 

Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and Phe; interchange of the hydroxyl residues Ser 
and Thr; exchange of the acidic residues Asp and Glu; substitution between the amide residues 
Asn and Gin; exchange of the basic residues Lys and Arg; and replacements among the aromatic 

25 residues Phe, Tyr. In addition, the following groups of amino acids generally represent 

equivalent changes: (1) Ala, Pro, Gly, Glu, Asp, Gin, Asn, Ser, Thr; (2) Cys, Ser, Tyr, Thr; (3) 
Val, lie, Leu, Met, Ala, Phe; (4) Lys, Arg, His; (5) Phe, Tyr, Trp, His. 

Similarly, amino acids in polypeptide sequences of the invention that are essential for 
function can also be identified by methods known in the art, such as site-directed mutagenesis or 

30 alanine-scanning mutagenesis (See, e.g., Cunningham et al. 1989). The latter procedure 
introduces single alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for obesity-related activity using assays as described above. Of special 
interest are substitutions of charged amino acids with other charged or neutral amino acids that 
may produce proteins with highly desirable improved characteristics, such as less aggregation. 

35 Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical 
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formulations, because aggregates can be immunogenic, (See, e.g., Pinckard, et al., 1967; Robbins, 
et ah, 1987;and Cleland, et al., 1993). 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of the present 
invention may be, for example: (i) one in which one or more of the amino acid residues are 
5 substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino 
acid residue) and such substituted amino acid residue may or may not be one encoded by the 
genetic code: or (ii) one in which one or more of the amino acid residues includes a substituent 
group: or (iii) one in which the polypeptide is fused with another compound, such as a compound 
to increase the half-life of the polypeptide (for example, polyethylene glycol): or (iv) one in 
1 0 which the additional amino acids are fused to the above form of the polypeptide, such as an IgG 
Fc fusion region peptide or leader or secretory sequence or a sequence which is employed for 
purification of the above form of the polypeptide or a pro-protein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope of those skilled in the art from the 
teachings herein. 

15 A further embodiment of the invention relates to a polypeptide which comprises the 

amino acid sequence of a polypeptide having an amino acid sequence which contains at least one 
conservative amino acid substitution, but not more than 50 conservative amino acid substitutions, 
not more than 40 conservative amino acid substitutions, not more than 30 conservative amino 
acid substitutions, and not more than 20 conservative amino acid substitutions. Also provided 

20 are polypeptides which comprise the amino acid sequence of polypeptide, having at least one, but 
not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions. 

Another specific embodiment of a modified polypeptide of the invention is a polypeptide 
that is resistant to proteolysis, for example a polypeptide in which a -CONH- peptide bond is 
modified and replaced by one or more of the following: a (CH2NH) reduced bond; a (NHCO) 

25 retro inverso bond; a (CH2-0) methylene-oxy bond; a (CH2-S) thiomethylene bond; a (CH2CH2) 
carba bond; a (CO-CH2) cetomethylene bond; a (CHOH-CH2) hydroxyethylene bond); a (N-N) 
bound; a E-alcene bond; or a -CH=CH- bond. Thus, the invention also encompasses a 
polypeptide or a fragment or a variant thereof in which at least one peptide bond has been 
modified as described above. 

30 In addition, amino acids have chirality within the body of either L or D. In some 

embodiments it is preferable to alter the chirality of the amino acids in the polypeptides of the 
invention in order to extend half-life within the body. Thus, in some embodiments, one or more 
of the amino acids are preferably in the L configuration. In other embodiments, one or more of 
the amino aicds are preferably in the D configuration. 



35 
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I . Leptin Polynucleotides of the Invention 

Polynucleotides have been designed that encode a LSR-binding/activating/modulating 
portion of the leptin protein. This region was identified by a comparison of the human and 
murine amino acid sequences, and its activity was confirmed in vitro and in vivo (See 
5 Examples 1-8). The recombinant polynucleotide encoding the LSR-activating leptin fragment 
can be used in a variety of ways, including: (1) to express the polypeptide in recombinant cells 
so as to be purified and used as described below, (2) to express the polypeptide in cells as part 
of an assay system to discover modulators of the leptin/LSR interaction, and (3) as part of a 
gene surgery where the fragment itself can be used in treatment and/or prevention of obesity- 

1 0 related diseases and disorders and modulating body mass. 

The invention relates to the polynucleotides encoding a leptin polypeptide fragment 
described in the Examples (7 & 8), and variants and fragments thereof as described herein in 
Leptin Polypeptides of the Invention (section II), as well as to variants and fragments of the 
polynucleotides that encode these polypeptides. Preferably, polynucleotides are purified, 

1 5 isolated and/or recombinant. 

In other preferred embodiments, variants of the leptin polynucleotides encoding leptin 
polypeptides as described herein in Leptin Polypeptides of the Invention are envisioned. 
Variants of polynucleotides, as the term is used herein, are polynucleotides whose sequence 
differs from a reference polynucleotide. A variant of a polynucleotide may be a naturally 

20 occurring variant such as a naturally occurring allelic variant, or it may be a variant that is not 
known to occur naturally. Such non-naturally occurring variants of the polynucleotide may be 
made by mutagenesis techniques, including those applied to polynucleotides, cells or 
organisms. Generally, differences are limited so that the nucleotide sequences of the reference 
and the variant are closely similar overall and, in many regions, identical. 

25 Variants of leptin polynucleotides according to the invention may include, without being 

limited to, nucleotide sequences which are at least 90% (preferably at least 95%, more preferably 
at least 99%, and most preferably at least 99.5%) identical to a polynucleotide that encodes a 
leptin polypeptide of the invention, or to any polynucleotide fragment of at least 8 (preferably at 
least 15, more preferably at least 25, and most preferably at least 45) consecutive nucleotides of 

30 a polynucleotide that encodes a polypeptide of the invention. 

Nucleotide changes present in a variant polynucleotide are preferably silent, which 
means that they do not alter the amino acids encoded by the polynucleotide. However, 
nucleotide changes may also result in amino acid substitutions, additions, deletions, fusions and 
truncations in the polypeptide encoded by the reference sequence. The substitutions, deletions or 

35 additions may involve one or more nucleotides. Alterations in the leptin coding regions of the 
invention may produce conservative or non-conservative amino acid substitutions, deletions or 
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additions in the encoded protein. Preferably, the nucleotide substitutions result in non- 
conservative amino acid changes and more preferably in conservative amino acid changes in the 
encoded polypeptide. 

In cases where the nucleotide substitutions result in one or more amino acid changes, 
5 preferred leptin polypeptides include those that retain the same activities and activity levels as 
the leptin polypeptide encoded by the reference polynucleotide sequence, as well as those where 
the level of one or more activities is increased, and alternatively where the level of one or more 
activities is decreased or even absent. Leptin polypeptide activities of the invention are 
described herein in the Examples in more detail (1-8, 10 & 14), but include LSR binding leading 
10 to the uptake and degradation of leptin, as well as the upregulation of LSR receptors that bind, 
uptake and degrade triglycerides. Examples of assays to determine the presence or absence of 
specific leptin activities and the level of the activity(s) are also described herein. 

By "retain the same activities" is meant that the activity measured using the polypeptide 
encoded by the variant leptin polynucleotide in assays is at least 75% (preferably at least 85%, 
1 5 more preferably at least 95%, most preferably at least 98%) and not more than 125% (preferably 
not more than 1 15%, more preferably not more than 105%, most preferably not more than 102%) 
of the activity measured using the leptin polypeptide encoded by the reference sequence. 

By the activity being "increased" is meant that the activity measured using the 
polypeptide encoded by the variant leptin polynucleotide in assays is at least 125% (preferably at 
20 least 150%, more preferably at least 200%, most preferably at least 500%) of the activity 
measured using the leptin polypeptide encoded by the reference sequence. 

By the activity being "decreased" is meant that the activity measured using the 
polypeptide encoded by the variant leptin polynucleotide in assays is not more than 75% 
(preferably not more than 50%, more preferably not more than 25%, most preferably not more 
25 than 10%) of the activity measured using the leptin polypeptide encoded by the reference 
sequence. 

By the activity being "absent" is meant that the activity measured using the polypeptide 
encoded by the variant leptin polynucleotide in assays is less than 25%, alternatively less than 
10% (preferably less than 5%, more preferably less than 2%, most preferably less than 1%) of 
30 the activity measured using the leptin polypeptide encoded by the reference sequence. 

A polynucleotide fragment is a polynucleotide having a sequence that entirely is the 
same as part, but not all, of a given nucleotide sequence, preferably the nucleotide sequence 
encoding a leptin polypeptide that binds and activates LSR, and variants thereof as described 
above, and the complements of these polynucleotides. Such fragments may be "free-standing", 
35 Le. not part of or fused to other polynucleotides, or they may be comprised within a single larger 
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non-leptin polynucleotide of which they form a part or region. However, several fragments may 
be comprised within a single larger polynucleotide. 

Optionally, such fragments may consist of a contiguous span that ranges in length from 
8, 10, 12, 15, 18 or 20 to 25, 35, 40, 50, 60, 70, 80, 90, 100, 1 10, 120, 130, 140, or 150 
5 nucleotides, or be specified as being 12, 15, 18, 20, 25, 35, 40, 50, 60, 70 , 80 , 90, 10, 1 10, 120, 
130, 140, or 150 nucleotides in length. 

A preferred embodiment of the invention includes isolated, purified, or recombinant 
polynucleotides consisting of a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 
70, 80, 90, 100, 110, 120, 130, 140, or 150 nucleotides encoding a leptin polypeptide of the 

1 0 invention, or the complements thereof, wherein said contiguous span encodes a fragment of 

leptin that retains the same activities and activity levels as the leptin polypeptide encoded by the 
reference polynucleotide sequence, or encodes a fragment of leptin where the level of one or 
more activities is increased, or alternatively where the level of one or more activities is decreased 
or even absent as described above. 

15 An additional preferred embodiment of the invention includes isolated, purified, or 

recombinant polynucleotides consisting of a contiguous span of 8 to 50 nucleotides of a leptin 
polypeptide of the invention, or their variants, or the complements thereof, wherein said 
contiguous span encodes a fragment of leptin that retains the same activities and activity levels 
as the leptin polypeptide encoded by the reference polynucleotide sequence, or encodes a 

20 fragment of leptin where the level of one or more activities is increased, or alternatively where 
the level of one or more activities is decreased or even absent as described above. Any of the 
above-described fragments may be comprised within a larger non-leptin polynucleotide 
fragment. 

25 I. Leptin Polypeptide Fragments of the Invention 

Leptin polypeptide fragments that bind/activate/modulate LSR have been identified 
(Examples 1-8). This region was identified by a comparison of the human and murine leptin 
amino acid sequences, and its activity confirmed in vitro and in vivo (See Examples 1-8). The 
advantages to having identified a leptin fragment responsible for leptin activity, include its use 

30 (1) as part of an assay system to discover leptin receptors and binding partners (in association 
with LSR for example), (2) as a lead molecule for the design of other compounds able to 
modulate LSR activity, and (3) as part of a treatment and/or prevention for obesity-related 
diseases and disorders. Knowledge of specific polypeptides involved is especially useful since it 
allows its use in assay systems (rather than the entire protein) and keeps the cost down (easily 

35 synthesized). In addition, a peptide can be expected to easily crystallize in the correct 

conformation to allow structure-function studies to design other small molecule activators. 
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Finally, use of just the active portion in treatment should increase the chances of the peptide 
remaining active and potentially decreasing side-effects. 

Furthermore, in the process of identifying the "active" portion of human leptin for 
human cells, a corresponding inhibitory portion of mouse leptin for human cells was identified. 
5 Comparisons between the two highly similar fragments will enable the identification of 

important residues for both increasing the activity of LSR and inhibiting the activity of LSR. 
This will be useful both in competitive assays for inhibitors and activators of LSR, and also for 
treatments in mammals and animals where inhibition of LSR is desired. 

The invention relates to leptin polypeptides as well as to variants, fragments, analogs and 

10 derivatives of the leptin polypeptides described herein, including modified leptin polypeptides. 
Preferred embodiments of the invention feature a leptin polypeptide that consists of a sequence 
described in Example 10, or variants, fragments, analogs, or derivatives thereof. Preferably the 
polypeptides are, purified, isolated and/or recombinant. 

In other preferred embodiments, the invention features a leptin polypeptide fragment that 

15 modulates the activity of LSR, comprising at least 4, but not more than 50 contiguous amino 
acids of any one of the leptin polypeptide sequences set forth in Figure 13, wherein said at least 
4 and not more than 50 contiguous amino acids comprise the leptin fragment central sequence. 
By the « leptin fragment central sequence » as used herein is meant the four variable amino acids 
of the active leptin peptide identified in Example 10 by sequence comparisons and molecular 

20 modeling. These residues comprise ETLD (SEQ ID NO:40) and QKPE (SEQ ID NO:41) for the 
human and mouse sequences, respectively, in Fig. 13. Preferably, the leptin polypeptide 
fragment comprises at least 10, but not more than 50, more preferably at least 15 but not more 
than 40, or at least 20 and not more than 40, or most prefereably at least at least 15 but not more 
than 30, or 20 but not more than 30 contiguous amino acids of any one of the leptin polypeptide 

25 sequences set forth in Figure 13, wherein said contiguous amino acids comprise the leptin 

polypeptide variable region. Preferably the leptin polypeptide fragment is human or mouse, but 
most preferably human, or a derivative or variant thereof. 

Variant leptin polypeptides of the invention may be 1) ones in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino acid residue and 

30 such substituted amino acid residue may or may not be one encoded by the genetic code, or 2) 
ones in which one or more of the amino acid residues includes a substituent group, or 3) ones in 
which a modified leptin polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol), or 4) ones in which 
the additional amino acids are fused to a modified leptin polypeptide, such as a leader or 

35 secretory sequence or a sequence which is employed for purification of the modified leptin 
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polypeptide or a pre-protein sequence. Such variants are deemed to be within the scope of those 
skilled in the art. 

Amino acid changes present in a variant polypeptide may be non-conservative amino 
acid changes but more preferably are conservative amino acid changes. In cases where there are 
5 one or more amino acid changes, preferred leptin polypeptides include those that retain the same 
activities and activity levels as the reference leptin polypeptide sequence, as well as those where 
the level of one or more activities is increased, and alternatively where the level of one or more 
activities is decreased or even absent. Assays for determining leptin polypeptide activities of the 
invention are described herein in the Examples (1-8 & 13) in more detail, but include LSR 
10 binding leading to the uptake and degradation of leptin, as well as the upregulation of LSR 
receptors that bind, uptake and degrade triglyceride-rich lipoproteins. Examples of assays to 
determine the presence or absence of specific leptin activities and the level of the activity(s) are 
also described herein. Defhitions of activities are provided in "Leptin Polynucleotides of the 
Invention" (section I). 

15 In preferred embodiments, the invention features a variant of a leptin polypeptide 

fragment that modulates the activity of LSR, consisting of a 22 contiguous amino acid sequence 
that is at least 75% identical to the leptin fragment variable region of any one of the leptin 
polypeptide sequences set forth in Figure 13. By the « leptin fragment variable region » as used 
herein is meant the region of 22 amino acids that is shaded in figure 13 for all the species in the 

20 alignment. Preferably, the 22 contiguous amino acid sequence is at least 85% identical to the 

leptin fragment variable region of any one of the leptin polypeptide sequences set forth in Figure 
13, more preferably 90% identical, most preferably 95% identical and optionally 100% identical. 
Preferably the sequence is human or mouse, and most preferably human. 

In yet other preferred embodiments, the invention features a variant of a leptin 

25 polypeptide fragment that modulates the activity of LSR, consisting of a 22 contiguous amino 
acid sequence, wherein at least 16 of the 22 amino acids are identical to the leptin fragment 
variable region of any one of the leptin polypeptide sequences set forth in Figure 13. Preferably, 
at least 1 8 of the 22 amino acids are identical to the leptin fragment variable region of any one of 
the leptin polypeptide sequences set forth in Figure 13, more preferably 20 of the 22 are 

30 identical, most preferably all of the amino acids are identical. Preferably the sequence is human 
or mouse, and most preferably human. 

A polypeptide fragment is a polypeptide having a sequence that is entirely the same as 
part, but not all, of a given polypeptide sequence, preferably a polypeptide encoded by a leptin 
gene and variants thereof. Such fragments may be "free-standing", i.e. not part of or fused to 

35 other polypeptides, or they may be comprised within a single larger non-leptin polypeptide of 
which they form a part or region. However, several fragments may be comprised within a single 
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larger polypeptide. As representative examples of polypeptide fragments of the invention, there 
may be mentioned those which have from about 4, 5, 6, 7, 8, 9 or 10 to 15, 10 to 20, 15 to 40, or 
30 to 55 amino acids long. Preferred are those fragments containing at least one amino acid 
substitution or deletion in a leptin polypeptide. 
5 The present invention is particularly focused on a set of variant leptin polypeptides 

and the fragments thereof. A preferred set of polypeptides of the invention include isolated, 
purified, or recombinant polypeptides comprising a contiguous span of at least 3 (preferably at 
least 6, more preferably at least 10, most preferably at least 15) amino acids of any of the 
leptin fragment variable regions of the sequences provided in Figure 13. 

10 

II. Zinc Finger Proteins of the Invention 

Zinc finger proteins of the Cys2His2 type are malleable DNA binding proteins that can 
be designed to bind diverse sequences, and that typically contain 3 zinc finger domains. The 
inventors contemplate the use of any zinc finger protein engineered to bind the DNA of interest, 

1 5 specifically. Although six-fingered proteins have been described to target unique sites within 
the genome (International Publication WO 98/543 1 1) proteins with different numbers of fingers 
that are engineered to bind specifically to the genome are also included in the invention. The 
six-fingered proteins described in WO 98/543 1 1, bind two 9 contiguous base pair fragments 
(separated by 0, 1, 2, or 3 nucleotides) of DNA or RNA in a sequence specific fashion, and can 

20 be used to regulate gene transcription. The zinc finger proteins of the invention also include 
those that are designed to bind sequences a greater distance apart and thereby confer greater 
specificity with fewer (or the same number, or more) "fingers". Methods for designing the zinc 
finger proteins of the invention, as well as for determining the sequences to which the zinc 
finger proteins bind, are described in International Publication WO 98/543 1 1 entitled "Zinc 

25 Finger Protein Derivatives and Methods Therefor". 

For one embodiment of the invention, zinc finger proteins have been designed that will 
bind to the 5' regulatory regions and selected introns of LSR and thereby inhibit or augment the 
transcription of endogenous LSR as described herein (Example 12). Exogenous LSR that is 
introduced into the cell without these regulatory regions or introns (cDNA) will be expressed 

30 normally. This can be useful in vitro both as a research tool to study the role of the various LSR 
components in leptin signaling and triglyceride-rich lipoprotein uptake and degradation, for 
example, and as part of an assay to discover modulators of LSRlep and LSRtg activity. 
Therefore, in currently preferred embodiments, zinc finger proteins are not designed to bind to 
the exons of LSR. However, in circumstances where no endogenous nor exogenously- 

35 introduced LSR activity is desired in a cell, for example, zinc finger proteins designed to bind to 
LSR exons could be useful. 
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The invention features a zinc finger protein, comprising a DNA binding domain that binds 
specifically to 18 nucleotides of a sequence at least 50% homologous to SEQ ID NO: 1, wherein said 1 8 
nucleotides comprise two fragments of 9 contiguous nucleotides, and wherein said fragments are 
separated by 0, 1, 2, or 3 nucleotides. In preferred embodiments, the zinc finger protein binds to 
5 sequences that are at least 50% homologous to the sequence of the introns of SEQ ED NO: 1 . Preferably, 
the sequence is at least 50% homologous to the sequence of the first intron of SEQ ID NO: 1 . In other 
preferred embodiments, the zinc finger protein binds specifically to 18 nucleotides of a sequence that is 
75% identical, 80%, 85%, or 90% identical, or most preferably 99 to 100% identical to SEQ ID NO :1, 
the introns of SEQ ID NO : 1 , or preferably the first intron of SEQ ID NO : 1 . 

10 In preferred embodiments of the invention, the zinc finger protein of the invention further 

comprises a functional domain selected from the group consisting of a transcription repressor and a 
transcription initiator. These repressors and initiators can be any that are known in the art. Preferably, 
the repressor is a KRAB repressor and the initiator is a VP 16 initiator. In highly preferred 
embodiments, the protein further comprises a small molecule regulatory system that can be any known 

15 in the art; however, the system is preferably selected from the group consisting of a Tet system, RU486, 
and ecdysone. 

It is envisioned that zinc finger proteins could be designed to bind to any 18 or more contiguous 
base pairs of a sequence at least 50%, preferably 75%, more preferably 90%, most preferably 95% 
identical to the 5' regulatory region (for example, residues 1-2000 of SEQ ID NO: 1) or any of the 

20 introns of LSR (for example, 2357 to 3539, 3885 to 12162, 12283 to 15143, 15201 to 17764, 15912 to 
19578, 19753 to 19898, 19959 to 20055, 20188 to 20328, and 20958 to 21046 of SEQ ID NO:l), and 
more preferably residues 2357 to 3539 of SEQ ID NO: 1 . In particular, introns within 3,000 base pairs 
of the LSR start site are preferred, for example introns 1 through 3. 

Guidance is available for determining optimal base pair stretches for zinc finger protein 

25 binding, and for determining what zinc finger amino acids will bind to what DNA sequences 
(WO 98/543 1 1). This information has been used to design an algorithm for designing zinc 
finger proteins available from Sangamo Biosciences. However, as described in WO 98/5431 1, 
zinc finger proteins for binding a given piece of DNA can be identified by screening or 
"panning" libraries of zinc finger proteins with the DNA sequence. Zinc finger libraries can be 

30 made, for example, by randomly mutating genes encoding known zinc finger proteins (WO 

98/5431 1). The effectiveness of the zinc finger protein identified by the panning procedure can 
then be assessed in the E. coli method described in WO 98/5431 1 (co-transfection of genes 
encoding the zinc finger protein and the gene of which the DNA sequence makes up a part). 
The effectiveness of the zinc finger protein for inhibiting LSR expression can be further tested 

35 using the assay systems described in the Examples (1-8); in particular the use of FACS 
following staining with an LSR specific antibody and quantitative PCR will be useful. 
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In preferred embodiments, addition of the zinc finger protein preferably inhibits LSR 
transcription completely, or inhibits LSR translation completely. By "inhibits transcription 
completely" is meant that the level of transcription following addition of the zinc finger protein 
is preferably below the level of detection by the assay used as compared to control cells. The 
5 assay used may be a Northern blot, or any other assay that measures RNA expression, such as 
quantitative PCR. Alternatively, the level of transcription of LSR may be significantly reduced. 
By "significantly reduced" is meant that the amount of RNA is preferably reduced at least 2- 
fold, more preferably at least 5-fold, and most preferably at least 10-fold compared to the level 
RNA prior to the addition of the zinc finger protein, or the level in control cells. 

10 Similarly, by "inhibits translation completely" is meant that LSR protein is preferably 

below the level of detection by the assay used compared with control cells. The assay used may 
be a Western blot, or dot blot, or other type of immunoassay for example, or any other assay 
known in the art to be used to measure or detect the presence of proteins, such as FACS with 
fluorescent antibodies to LSR. Alternatively, the level of translation of LSR may be 

15 significantly reduced. By "significantly reduced" is meant the amount of protein present is 
preferably reduced at least 2-fold, more preferably at least 5-fold, most preferably at least 10- 
fold compared to the level of protein present prior to the addition of zinc finger protein, or in 
control cells. 

Highly preferred sequences to be used for designing zinc finger proteins include, 

20 residues 1841 to 1860, 1880 to 1898, 1918 to 1945, 1951 to 1973, and 3362 to 3382 of human 
LSR (SEQ ID NO:l) and of the homologous regions in genes coding for LSR proteins of other 
species, preferably including mouse and rat LSR. The genomic sequences encoding LSR from 
other species can be identified by methods well-known in the art. 

These zinc finger proteins can also be useful in vivo both as part of an assay system in 

25 animal models to discover modulators of LSRlep (at least a', may include P and/or a) and 
LSRtg (at least a, may include p and/or a') activity, as well as in gene surgery in which 
transcription of endogenous LSR is inhibited as part of the treatment for an obesity-related 
disease or disorder. This could be useful in a case where the LSR message was being over- 
expressed, or incorrectly expressed (mutated), for example. A potential therapy would include 

30 providing this zinc finger protein alone, in cases of simple over-expression, or in conjunction 
with other appropriate components of LSR if the cellular LSR was mutated. These proteins 
could be targeted to the appropriate cells (those with LSR) by using liposomes, for example, 
with leptin or another LSR binding protein in the liposome membrane. 

In an alternative embodiment of the invention, zinc finger proteins are designed to bind 

35 to the 5' regulatory regions of LSR and thereby increase the transcription of endogenous LSR. 
Typically, within the 5' regulatory region of genes are promoters as well as other regulatory 
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elements. Binding of zinc finger proteins to certain regions of the DNA may serve to facilitate 
binding of the initiation complex and thus transcription of the gene. For instance, where some 
unusual folding prevents access to the promoter region, if a zinc finger protein were to bind the 
DNA upstream such that the folding were prevented, then the promoter would have greater 
5 access and enhanced transcription should result. Alternatively, it may be possible to design a 
zinc finger protein that binds the promoter region directly, thereby initiating transcription. 

In these and other circumstances, zinc finger binding proteins designed to bind stretches 
of DNA in the 5'regulatory region as described above can be tested for their ability to enhance 
transcription of LSR. Thus, in preferred embodiments, addition of the zinc finger protein 

1 0 preferably significantly increases LSR transcription, or significantly increases LSR translation. 
By "significantly increases LSR transcription" is meant that the level of transcription following 
addition of the zinc finger protein is preferably increased at least 2-fold, more preferably at least 
5 -fold, and most preferably at least 10-fold compared to the level RNA prior to the addition of 
the zinc finger protein. The assay used may be a Northern blot, or any other assay that measures 

1 5 RNA expression. Alternatively, if the starting level of RNA transcription is below the level of 
detection by the assay used, "significantly increases LSR transcription" may mean that the level 
of transcription of LSR may become detectable on the addition of the zinc finger binding 
protein. 

Similarly, by "significantly increases LSR translation" is meant that the level of 

20 translation following addition of the zinc finger protein is preferably increased at least 2-fold, 
more preferably at least 5-fold, and most preferably at least 10-fold compared to the level of 
translation prior to the addition of the zinc finger protein. The assay used may be a Western 
blot, or dot blot, or other type of immunoassay for example, or any other assay known in the art 
to be used to measure or detect the presence of proteins. Alternatively, if the starting level of 

25 LSR protein is below the level of detection by the assay used, "significantly increases LSR 
translation" may mean that LSR protein may become detectable after the addition of the zinc 
finger binding protein. 

These zinc finger proteins can be useful in vivo in gene surgery in which transcription of 
endogenous LSR is enhanced as part of the treatment for an obesity-related disease or disorder. 

30 This can be envisioned in a situation where higher levels of the LSR protein are thought to be 
advantageous for the patient clinically. For example, increased expression of LSR could be 
advantageous when the LSR gene is normal, but is expressed at lower than normal levels, or 
when it is expressed at normal levels, but does not function as efficiently as it should in clearing 
triglycerides from the bloodstream, or when some other abnormality results in abnormally high 

35 levels of triglycerides and an increased amount of LSR protein is necessary to clear them. 
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In a further alternative embodiment of the invention, zinc finger proteins are designed to 
bind to any sequence of 18 or more contiguous base pairs of LSR mRNA and thereby inhibit 
translation of LSR. In preferred embodiments, expression of all three forms of LSR are 
inhibited by the zinc finger protein. In an alternative embodiment, zinc finger proteins are 
5 designed to specifically inhibit expression of the LSR a, a', or p subunit individually, or to 
inhibit both the LSR a and a' subunits. All three forms of LSR can be inhibited by zinc finger 
proteins targeted to mRNA fragments transcribed from exons one through 3 and exon 6 to the 
end. The a subunit can be targeted with zinc finger proteins designed to bind in exon 4. The a' 
subunit can be targeted with zinc finger proteins designed to bind to the splice site between exon 

10 3 and exon 5. The P subunit can be targeted with zinc finger proteins designed to bind to the 
splice site between exon 3 and exon 6. Both the a and a' subunits can be targeted with zinc 
finger proteins designed to bind to exon 5. 

These zinc finger proteins would be useful for many of the uses previously described for 
zinc finger proteins binding to and inhibiting or increasing transcription of LSR DNA. 

1 5 Similarly the definitions for inhibiting or increasing LSR transcription and tests for the desired 
zinc finger proteins and methods for designing and making them would be as previously 
described. In addition, for all of the zinc fingers described, it should be remembered that the 
system can be further controlled by addition of a small molecule control system (for example the 
Tet-responsive system, or RU486, or ecdysone) to the cell. This allows greater control/greater 

20 finesse for an in vitro assay system, in particular, but can be used in vivo as well. The basic idea 
is to provide the zinc finger with part of the Tet system integrated upstream such that 
transcription of the zinc finger protein can be regulated by the addition of an outside element, 
for example Dox or Tc. These methods are well-known to those in the art. 

25 III. Polynucleotides Encoding Zinc Finger Polypeptides of the Invention 

The invention also features polynucleotides that encode the zinc finger polypeptides of 

the invention described above. In one method of identifying the desired zinc finger 

polypeptides of the invention, libraries are screened (panned) for those clones expressing a zinc 

finger protein that binds to the desired nucleotide sequence. Frequently, multiple clones are 
30 identified that express zinc finger proteins that bind to the nucleotide sequence. All the variant 

polynucleotides that code for the zinc finger polypeptide(s) that bind to the desired sequence 

are also part of the present invention. 

Variants of polynucleotides, as the term is used here, are polynucleotides whose 

sequence differs from a reference polynucleotide; in this case a reference polynucleotide is the 
35 polynucleotide that is ultimately chosen to be used. Thus, the variant of the polynucleotide 

would frequently be the result of mutagenesis techniques as described in WO 98/5431 1. 
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Generally, differences are limited so that the nucleotide sequences of the reference and the 
variant are closely similar overall and, in many regions, identical. 

Nucleotide changes present in a variant polynucleotide are preferably silent, which 
means that they do not alter the amino acids encoded by the polynucleotide. However, 
5 nucleotide changes may also result in amino acid substitutions, additions, deletions, fusions and 
truncations in the polypeptide encoded by the reference sequence. The substitutions, deletions or 
additions may involve one or more nucleotides. Alterations in the zinc finger polypeptide coding 
regions of the invention may produce conservative or non-conservative amino acid substitutions, 
deletions or additions in the encoded protein. Preferably, the nucleotide substitutions result in 

10 non-conservative amino acid changes and more preferably in conservative amino acid changes in 
the encoded polypeptide. 

In cases where the nucleotide substitutions result in one or more amino acid changes, 
preferred zinc finger polypeptides include those that retain the same activities and activity levels 
as the zinc finger polypeptide encoded by the reference polynucleotide sequence, as well as those 

15 where the level of one or more activities is increased, and alternatively where the level of one or 
more activities is decreased or even absent. Zinc finger polypeptide activities of the invention 
and methods for testing are described above. 

A polynucleotide fragment is a polynucleotide having a sequence that entirely is the 
same as part, but not all, of a given nucleotide sequence, preferably the nucleotide sequence 

20 encoding a zinc finger polypeptide, and variants thereof, as described above, and the 

complements of these polynucleotides. Such fragments may be "free-standing", i.e. not part of 
or fused to other polynucleotides, or they may be comprised within a single larger polynucleotide 
of which they form a part or region. However, several fragments may be comprised within a 
single larger polynucleotide. Optionally, such fragments may consist of a contiguous span that 

25 ranges in length from 8, 10, 12, 15, 18 or 20 to 25, 35, 40, 50, or 60 nucleotides, or be specified 
as being 12, 15, 18, 20, 25, 35, 40, 50, or 60 nucleotides in length. 

IV. Chimeric Oligonucleotides of the Invention 

Chimeraplasty is a technique used to change the nucleotide sequence of DNA of cells 

30 and of animals (Science 285 :3 16-318 (1999)). It can be used to create or to correct mutations, 
usually point mutations, that have an effect on the protein coding sequence. The technique relies 
on hybrid molecules of DNA and RNA called chimeras that contain DNA with a mutation in its 
sequence (compared to the target sequence in the cell) flanked by RNA that perfectly mirrors the 
flanking target gene sequence. The target gene sequence is thought to be modified through the 

35 action of the celPs DNA repair machinery as a result of the pairing of the target DNA with the 
chimera containing the mutated sequence. 
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In the present invention, the advantages to using chimeraplasty to modify LSR include : 
(1) ease of creating cells lacking LSR polypeptides for use in assays or gene surgery; (2) 
specifically blocking production of the a subunit or the a and a 7 subunits for use in assays or in 
gene surgery; and (3) the ability to correct defects in the LSR gene in cells in vitro and in vivo 
5 for use in gene surgery. Chimeraplasty has been shown to be effective for correcting (or 
creating) mutations in cells in vitro and in vivo in animals (Cole-Strauss, et al. Science 273 : 
13861389 (1996) ; Alexeev and Yoon Nature Biotechnology 16 : 1343-1346 (1998) ; Kren et al 
Nature Medecine 4 : 285-290 (1998) ; Yoon et al Proc Natl. Acad. Sci. USA 93 : 2071-2076 
(1996) ; Xiang et al J Mol Med 75 : 829-825 ( 1997), hereby encorporated by reference herein in 
1 0 their entirety including any figures, drawings, or tables). Chimeraplasty is particularly useful in 
cases of point mutations that need to be corrected to allow either expression or function of the 
protein. 

Chimeraplasty apparently works through the cell's own DNA repair system to correct the 
targeted gene. Although the gene is not corrected in 100% of the cells following transfection in 

1 5 vitro or introduction into the animal in vivo, the genes in enough of the cells have been found to 
be changed to permit a clinically detectable change. This could, in fact, be beneficial in the LSR 
system where it is unlikely that you would ever want to completely prevent LSR expression. 
However, reduction in LSR expression might be advantageous in some obesity-related diseases 
and disorders. In particular, specific reduction in any one or more of the a, a', or P subunits 

20 could be advantageous. 

The invention features a chimeric oligonucleotide, comprising at least 9 contiguous nucleotides 
from a sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO: 10, SEQ ED NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, and SEQ ID NO: 16, 
wherein said at least 9 contiguous nucleotides comprise at least one amino acid codon selected from the 

25 group consisting of TTA, TTG, TCA, TCG, TAU, TAC, TGT, TGC, TGG, CAA, CAG, AG A, GAA, 
GAG, and GGA, and wherein a point mutation is present in said codon such that said codon is a stop 
codon. In preferred embodiments, the sequence is selected from the group consisting of Exon 1, Exon 4 
and Exon 5 from SEQ ID NO : 1 and homologous sequences from mouse or rat, preferably mouse. 
Another embodiment of the invention features chimeraplast LSR polynucleotides, where 

30 the polynucleotide comprises at least 7 (preferably at least 13, more preferably at least 25, most 
preferably at least 35 nucleotides of the LSR gene (or its complement), and where the DNA 
portion of the chimera comprises a point mutation such that instead of coding for an amino acid, 
it now codes for a termination codon. Thus, substitution of this nucleotide for the nucleotide 
present in the endogenous LSR gene, results in a stop codon being created at the site. The other 

35 nucleotides present in both the DNA and RNA portions of the chimera are 100% complementary 
to the flanking regions of the endogenous LSR gene. The DNA portion of the chimera is at least 
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3 consecutive nucleotides in length, preferably at least 5 consecutive nucleotides in length, 
optionally at least 7 or at least 1 1 nucleotides in length. The point mutation is preferably the 
middle nucleotide (n ; alternatively n+1, or n-1 ; less preferably n+2, or n-2 ; n+3, or n-3, etc.) of 
the DNA part of the chimera when the DNA portion has an odd number of nucleotides (AGnCT, 
5 AnGCT, AGCnT, for example), or the n+1 or n-1 positions (less preferably n+2, or n-2 ; n+3, or 
n-3, etc.) when the sequence has an even number of nucleotides (AnCT, AcnT, for example). 
The KNA portion of the chimera is at least 4 consecutive nucleotides in length, preferably at 
least 10 consecutive nucleotides in length, more preferably at least 20 consecutive nucleotides in 
length, and most preferably at least 30 consecutive nucleotides in length. The RNA portion of 

10 the chimera flanks the DNA portion of the chimera, preferably with an equal number of 

nucleotides on each side of the DNA sequence (x ; when the number on RNA residues is even), 
less preferably with x+1 on the upstream side and x-1 on the downstream side or alternatively 
x+1 on the downstream side and x-1 on the upstream side ; even less preferably with x+2 on the 
upstream side and x-2 on the downstream side or alternatively x+2 on the downstream side and 

1 5 x-2 on the upstream side, and so on. Similarly, when the number of RNA residues is odd, there 
are either x+1 on the upstream side and x-1 on the downstream side or alternatively x+1 on the 
downstream side and x-1 on the upstream side of the DNA; less preferably there are x+2 on the 
upstream side and x-2 on the downstream side or alternatively x+2 on the downstream side and 
x-2 on the upstream side, and so on. In some cases, particularly when the point mutation is not 

20 in the center of the DNA part of the chimera, the number of residues of RNA flanking the DNA 
is preferably not equal on both sides. In some cases it is preferred that there are more RNA 
residues on one side than the other so as to have the point mutation be located at the center of the 
chimera, or at least n+1 or n-1 from the center of the chimera, less preferably n+2, or n-2 from 
the center, etc. Sequences that encode stop codons include TAA, TAG, and TGA. Therefore, 

25 sequences encoding the amino acids leucine (TTA or TTG), serine (TCA or TCG), tyrosine 

(TAU or TAC), cysteine (TGT or TGC), tryptophan (TGG), glutamine (CAA or CAG), arginine 
(AGA), glutamate (GAA or GAG), or glycine (GGA), for example, can be changed to one of the 
stop codons by a single polynucleotide exchange. The preferred stop codon is TGA. The exact 
design of the chimeras will depend on the particular sequence to be mutated, but guidance has 

30 been given in the papers listed above and in the Examples herein. In general, however, the 
sequence should be at least 14 nucleotides in length (preferably 18, more preferably 25, most 
preferably 30) to ensure specificity to the desired sequence. Preferably, the amino acid to be 
mutated to a termination codon is located at the 5' end of the coding sequence, preferably within 
the first exon, and preferably is the first amino acid that can be mutated in this way after the first 

35 ATG or most preferably the second ATG. Amino acids to be mutated to stop all LSR expression 
should not be selected from Exon 4 or Exon 5, since exon 4 is not present in the a' subunit, and 
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neither Exon 4 nor Exon 5 is present in the p subunit. The success of a chimeraplast in 
preventing LSR expression can be tested using the techniques described herein, to include 
screens for the presence of the mRNA by Northern blot, for example, and for the protein by 
Western blot, for example. 
5 Alternatively, in some preferred embodiments it is preferable to stop expression of the 

LSR a subunit only. To do this, the amino acid to be mutated is preferably located in Exon 4 of 
LSR, since this Exon is not present in the a' or p subunits. In other preferred embodiments it is 
preferable to prevent expression of both a and a' subunits, but not the p subunit. To do this, the 
amino acid to be mutated is preferably located in Exon 5 of LSR, since this exon is present in 

1 0 both a and a' subunits, but not the P subunit. 

In another embodiment, the invention feaures chimeraplast LSR polynucleotides, where 
the polynucleotide comprises at least 7 (preferably at least 13, more preferably at least 25, most 
preferably at least 35 nucleotides of the LSR gene (or its complement), and where the DNA 
portion of the chimera comprises one of the alleles of the single nucleotide polymorphisms 

1 5 (SNPs) described in U.S. Provisional Application No. 60/1 1 9, 592, entitled « Polymorphic 
Markers of the LSR Gene » by Blumenfeld, Bougueleret, and Bihain, filed February 10, 1999 
and indicated in Table A. Preferably, the SNP's are selected from the group consisting of Al, 
A2, A3, A4, A5, A6, A7, A8, A9, A10, Al 1, A12, A13, A14, A15, A16, A17, A18, A19, A20, 
A21, A22, A23, A24, A25, A26, A27, A28, A29, A30, A31, AND A32. The SNPs may be in 

20 either coding or non-coding regions of the LSR gene. Some SNPs in the coding region result in 
amino acid changes that may affect the activity of LSR. However, the majority of the SNPs do 
not code for amino acid changes. These nucleotide changes can also modulate the activity of 
LSR in a variety of ways, for example by interfering with the binding of a regulatory molecule 
that influences the splicing of the introns, particularly where there is differential splicing 

25 depending on the subunit to be expressed or by affecting the binding of promoters or the function 
of other regulatory sequences in the 5' and 3' regions of the gene. Changes in the expression of 
various subunits, or the levels of expression of LSR in general, can have profound effects on the 
obesity of patients. 

VI. Recombinant Vectors of the Invention 

30 The term "vector" is used herein to designate either a circular or a linear DNA or RNA 

molecule, that is either double-stranded or single-stranded, and that comprises at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
multicellular host organism. 

The present invention relates to recombinant vectors comprising any one of the 

35 polynucleotides described herein. 
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The present invention encompasses a family of recombinant vectors that comprise 
polynucleotides encoding leptin polypeptides of the invention, polynucleotides encoding zinc 
finger proteins of the invention, and chimeraplastic polynucleotides of the invention as described 
herein. 

5 In a first preferred embodiment, a recombinant vector of the invention is used to amplify 

the inserted polynucleotide in a suitable cell host, this polynucleotide being amplified every time 
that the recombinant vector replicates. The inserted polynucleotide can be one that encodes 
leptin polypeptides of the invention or zinc finger polypeptides of the invention, or a 
chimeraplast polynucleotide. 

10 A second preferred embodiment of the recombinant vectors according to the invention, 

consists of expression vectors comprising either a polynucleotide encoding leptin polypeptides of 
the invention or zinc finger proteins of the invention, or both. Within certain embodiments, 
expression vectors are employed to express a leptin polypeptide of the invention, preferably a 
modified leptin polypeptide described in the present invention, which can be then purified and, 

1 5 for example, be used in screening assays or as a treatment for obesity-related diseases. In other 
embodiments, expression vectors are employed to express a zinc finger protein of the invention, 
preferably one that inhibits LSR expression or expression of specific subunits of LSR as 
described in the present invention, which can be then purified and, for example, be used in 
screening assays or as a treatment for obesity-related diseases. In other embodiments, the 

20 expression vectors are used for constructing transgenic animals and also for gene surgery, in 
particular, expression vectors containing a polynucleotide encoding zinc finger proteins of the 
invention. 

Expression requires that appropriate signals are provided in the vectors, said signals 
including various regulatory elements, such as enhancers/promoters from both viral and 

25 mammalian sources, that drive expression of the genes of interest in host cells. Dominant drug 
selection markers for establishing permanent, stable, cell clones expressing the products are 
generally included in the expression vectors of the invention, as they are elements that link 
expression of the drug selection markers to expression of the polypeptide. 

More particularly, the present invention relates to expression vectors which include 

30 nucleic acids encoding a leptin polypeptide fragment of the invention, or a modified leptin 
polypeptide as described herein, or variants or fragments thereof, under the control of a 
regulatory sequence selected among the leptin regulatory polynucleotides, or alternatively under 
the control of an exogenous regulatory sequence. The present also relates to expression vectors 
which include nucleic acids encoding a zinc finger polypeptide of the invention, or a modified 

35 zinc finger polypeptide as described herein, or variants or fragments thereof, under the control of 
an exogenous regulatory sequence. 



10 
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Consequently, preferred expression vectors of the invention are selected from the group 
consisting of : (a) a leptin regulatory sequence and driving the expression of a coding 
polynucleotide operably linked thereto; (b) a leptin polypeptide coding sequence of the 
invention, operably linked to regulatory sequences allowing its expression in a suitable cell host 
and/or host organism. Other preferred expression vectors of the invention comprise a zinc finger 
polypeptide coding sequence of the invention, operably linked to regulatory sequences allowing 
its expression in a suitable cell host and/or host organism. 

Some of the elements which can be found in the vectors of the present invention are 
described in further detail in the following sections. 



1) General features of the expression vectors of the invention : 

A recombinant vector according to the invention comprises, but is not limited to, a YAC 
(Yeast Artificial Chromosome), a BAC (Bacterial Artificial Chromosome), a phage, a phagemid, 
a cosmid, a plasmid, or even a linear DNA molecule which may consist of a chromosomal, non- 

1 5 chromosomal, semi-synthetic or synthetic DNA. Such a recombinant vector can comprise a 
transcriptional unit comprising an assembly of : 

(1) a genetic element or elements having a regulatory role in gene expression, for 
example promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from about 
10 to 300 bp in length that act on the promoter to increase the transcription. 

20 (2) a structural or coding sequence which is transcribed into mRNA and eventually 

translated into a polypeptide, said structural or coding sequence being operably linked to the 
regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units 
intended for use in yeast or eukaryotic expression systems preferably include a leader sequence 

25 enabling extracellular secretion of translated protein by a host cell. Alternatively, when a 
recombinant protein is expressed without a leader or transport sequence, it may include a N- 
terminal residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

Generally, recombinant expression vectors will include origins of replication, selectable 

30 markers permitting transformation of the host cell, and a promoter derived from a highly 

expressed gene to direct transcription of a downstream structural sequence. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and termination 
sequences, and preferably a leader sequence capable of directing secretion of the translated 
protein into the periplasmic space or the extracellular medium. In a specific embodiment 

35 wherein the vector is adapted for transfecting and expressing desired sequences in mammalian 
host cells, preferred vectors will comprise an origin of replication in the desired host, a suitable 



WO 01/21647 



35 



PCT/IBOO/01470 



promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation sites, 
splice donor and acceptor sites, transcriptional termination sequences, and 5*-flanking non- 
transcribed sequences. DNA sequences derived from the SV40 viral genome, for example S V40 
origin, early promoter, enhancer, splice and polyadenylation sites may be used to provide the 
5 required non-transcribed genetic elements. 

2) Regulatory elements 
Promoters 

The suitable promoter regions used in the expression vectors according to the present 
10 invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. The particular promoter employed to control the expression of a nucleic acid 
sequence of interest is not believed to be important, so long as it is capable of directing the 
expression of the nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is 
preferable to position the nucleic acid coding region adjacent to and under the control of a 
1 5 promoter that is capable of being expressed in a human cell, such as, for example, a human or a 
viral promoter. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression or alternatively can be endogenous to the native polynucleotide 
containing the coding sequence to be expressed. Additionally, the promoter is generally 
20 heterologous with respect to the recombinant vector sequences within which the construct 
promoter/coding sequence has been inserted. 

Promoter regions can be selected from any desired gene using, for example, CAT 
(chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. 
Preferred bacterial promoters are the Lad, LacZ, the T3 or T7 bacteriophage RNA polymerase 
25 promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin promoter, or 
the plO protein promoter from baculovirus (Kit Novagen) (Smith et al. (1983) Mol. Cell. Biol. 
3:2156-2165.; O'Reilly et al., 1992, Baculovirus expression vectors : a Laboratory Manual. W.H. 
Freeman and Co., New York the lambda PR promoter or also the trc promoter. 

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and 
30 late SV40, LTRs from retrovirus, and mouse metallothionein-L. Selection of a convenient vector 
and promoter is well within the level of ordinary skill in the art. 

The choice of a promoter is well within the ability of a person skilled in the field of 
genetic egineering. For example, one may refer to the book of (Sambrook, J., Fritsch, E.F., and 
T. Maniatis. (1989), Molecular Cloning: A Laboratory Manual 2ed. Cold Spring Harbor 
35 Laboratory, Cold Spring Harbor, New York) )or also to the procedures described by (Fuller S. 
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A. et al. (1996) Immunology in Current Protocols in Molecular Biology, Ausubel et ah, Eds, 
John Wiley & Sons, Inc., USA). 



Other regulatory elements 
5 Where a cDNA insert is employed, one will typically desire to include a polyadenylation 

signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation 
signal is not believed to be crucial to the successful practice of the invention, and any such 
sequence may be employed such as human growth hormone and SV40 polyadenylation signals. 
Also contemplated as an element of the expression cassette is a terminator. These elements can 
1 0 serve to enhance message levels and to minimize read through from the cassette into other 
sequences. 

The vector containing the appropriate DNA sequence as described above, more 
preferably LSR gene inhibitory or activating polynucleotide, a polynucleotide encoding a leptin 
polypeptide or both of them, can be utilized to transform an appropriate host to allow the 
1 5 expression of the desired polypeptide or polynucleotide. 

3) Selectable markers 

Such markers would confer an identifiable change to the cell permitting easy 
identification of cells containing the expression construct. The selectable marker genes for 
20 selection of transformed host cells are preferably dihydrofolate reductase or zeocin, hygromycin 
or neomycin resistance for eukaryotic cell culture, TRP1 for 5. cerevisiae or tetracycline, 
rifampicin or ampicillin resistance in E. coli 9 or levan saccharase for mycobacteria, this latter 
marker being a negative selection marker. 



25 4) Preferred vectors 

Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and a bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of pBR322 (ATCC 37017). Such commercial 
30 vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and GEM1 (Promega 
Biotec, Madison, WI, USA). 

Large numbers of other suitable vectors are known to those of skill in the art, and are 
commercially available, such as the following bacterial vectors : pQE70, pQE60, pQE-9 
(Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, pNH18A, 
35 pNH46A (Stratagene); ptrc99a, pKK223-3, pKJC233-3, pDR540, pRIT5 (Pharmacia); pWLNEO, 
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pSV2CAT, pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, pSVL (Pharmacia); pQE- 
30 (QIAexpress). 

Baculovirus vectors 

5 A suitable vector for the expression polypeptides of the invention is a baculovirus vector 

that can be propagated in insect cells and in insect cell lines. A specific suitable host vector 
system is the pVL1392/1393 baculovirus transfer vector (Pharmingen) that is used to transfect 
the SF9 cell line (ATCC N°CRL 171 1) which is derived from Spodoptera frugiperda. 

Other suitable vectors for the expression of a leptin polypeptide in a baculovirus 
1 0 expression system include those described by (Chai H. et al. (1993), Biotechnol AppL 

Biochem. 18:259-273; Vlasak R. et al. (1983), Eur. J. Biochem. 135:123-126; Lenhard T. et al. 
(1996), Gene. 169:187-190). 

Viral vectors 

15 In one specific embodiment, the vector is derived from an adenovirus. Preferred 

adenovirus vectors according to the invention are those described by Feldman and Steg (1996) or 
Ohno et al. (1994). Another preferred recombinant adenovirus according to this specific 
embodiment of the present invention is the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an 
adenovirus of animal origin ( French patent application N° FR-93 .05954). 

20 Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 

recombinant gene delivery systems of choice for the transfer of exogenous polynucleotides in 
vivo , particularly to mammals, including humans. These vectors provide efficient delivery of 
genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal 
DNA of the host. 

25 Particularly preferred retroviruses for the preparation or construction of retroviral in vitro or in 
vitro gene delivery vehicles of the present invention include retroviruses selected from the group 
consisting of Mink-Cell Focus Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis 
virus and Rous Sarcoma virus. Particularly preferred Murine Leukemia Viruses include the 
4070A and the 1504A viruses, Abelson (ATCC No VR-999), Friend (ATCC No VR-245), Gross 

30 (ATCC No VR-590), Rauscher (ATCC No VR-998) and Moloney Murine Leukemia Virus 
(ATCC No VR-190; PCT Application No WO 94/24298). Particularly preferred Rous Sarcoma 
Viruses include Bryan high titer (ATCC Nos VR-334, VR-657, VR-726, VR-659 and VR-728). 
Other preferred retroviral vectors are those described in Roth J. A. et al. (1996), Nature Medicine. 
2(9):985-991 PCT Application No WO 93/25234, PCT Application No WO 94/ 06920, Roux et 

35 al, 1989, Proc. Natl Acad Sci. USA, 86 : 9079 - 9083, Julan et ai., 1992, J. Gen. Virol, 73 : 
3251 - 3255 Neda et al., 1991, J. Biol. Chem., 266 : 14143 - 14146. 
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Yet another viral vector system that is contemplated by the invention consists of the 
adeno-associated virus (AAV). The adeno-associated virus is a naturally occurring defective 
virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for 
efficient replication and a productive life cycle (Muzyczka et al., 1992, Curr. Topics in Micro. 
5 and Immunol., 158 : 97-129). It is also one of the few viruses that may integrate its DNA into 
non-dividing cells, and exhibits a high frequency of stable integration (Flotte et al., 1992, Am. J. 
Respir. Cell Mol BioL, 7:349-356; Samulski et al., 1989, J. Virol, 63 : 3822-3828; 
McLaughlin B.A. et al. (1996), Am. J. Hum. Genet. 59:561-569. One advantageous feature of 
AAV derives from its reduced efficacy for transducing primary cells relative to transformed 
10 cells. 



5) Delivery of the recombinant vectors 

In order to effect expression of the polynucleotides of the invention, these constructs 
must be delivered into a cell. This delivery may be accomplished in vitro, as in laboratory 
1 5 procedures for transforming cell lines, or in vivo or ex vivo, as in the treatment of certain disease 
states. 

One mechanism is viral infection where the expression construct is encapsulated in an 
infectious viral particle. 

Several non-viral methods for the transfer of polynucleotides into cultured mammalian cells are 
20 also contemplated by the present invention, and include, without being limited to, calcium 
phosphate precipitation (Graham et al. (1973), Virology. 52:456-457; Chen et al., 1987, Mol. 
Cell. Biol., 7 : 2745-2752;), DEAE-dextran (Gopal, 1985, Mol. Cell. Biol., 5 : 1188-1190 
electroporation (Tur-Kaspa et al. (1986), Mol. Cell. Biol. 6:716-718; Potter et al., 1984, Proc 
Natl Acad Sci USA. 81(22):7161-5) direct microinjection (Harland et al., 1985, J. Cell. Biol., 
25 101:1094-1095) DNA-loaded liposomes (Nicolau et al., 1982, Biochim. Biophys. Acta, 721:185- 
190; Fraley et al., 1979, Proc. Natl. Acad. Sci. USA, 76 : 3348-3352 and receptor-mediate 
transfection (Wu and Wu, 1987, J. Biol. Chem, 262 : 4429^432; Wu and Wu,1988, 
Biochemistry, 27:887-892). Some of these techniques may be successfully adapted for in vivo or 
ex vivo use. 

30 Once the expression polynucleotide has been delivered into the cell, it may be stably 

integrated into the genome of the recipient cell. This integration may be in the cognate location 
and orientation via homologous recombination (gene replacement) or it may be integrated in a 
random, non specific location (gene augmentation). In yet further embodiments, the nucleic acid 
may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid 

35 segments or "episomes" encode sequences sufficient to permit maintenance and replication 
independent of or in synchronization with the host cell cycle. 
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One specific embodiment for a method for delivering a protein or peptide to the interior 
of a cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 
physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the 
5 naked polynucleotide is taken up into the interior of the cell and has a physiological effect. This 
is particularly applicable for transfer in vitro but it may be applied to in vivo as well. 

Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 
described in PCT application No. WO 90/1 1092 (Vical Inc.) and also in PCT application No. 
WO 95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of 
10 Tacson et al. (1996) Nature Medicine. 2(8):888-892 and Huygen et al. (1996) Mature Medicine. 
2(8):893-898. 

In still another embodiment of the invention, the transfer of a naked polynucleotide of 
the invention, including a polynucleotide construct of the invention, into cells may be proceeded 
with a particle bombardment (biolistic), said particles being DNA-coated microprojectiles 
1 5 accelerated to a high velocity allowing them to pierce cell membranes and enter cells without 
killing them, such as described by Klein et al. (1987) Nature. 327:70-73. 

In a further embodiment, the polynucleotide of the invention may be entrapped in a 
liposome (Ghosh and Bacchawat, 1991, Targeting of liposomes to hepatocytes, IN : Liver 
Diseases, Targeted diagnosis and therapy using specific receptors and ligands. Wu et al. Eds., 
20 Marcel Dekeker, New York, pp. 87-104; Wong et aL, 1980, Gene, 10 : 87-94; Nicolau C. et al. 
(1987), Methods Enzymol. 149: 157-76). These liposomes may further be targeted to cells 
expressing LSR by incorporating leptin, triglycerides, Acrp30, or other known LSR ligands into 
the liposome membrane. 

In a specific embodiment, the invention provides a composition for the in vivo 
25 production of a leptin polypeptide, or a zinc finger protein, described herein. It comprises a 
naked polynucleotide operatively coding for this polypeptide, in solution in a physiologically 
acceptable carrier, and suitable for introduction into a tissue to cause cells of the tissue to 
express the said polypeptide. 

The amount of vector to be injected to the desired host organism varies according to the 
30 site of injection. As an indicative dose, it will be injected between 0,1 and 100 \ig of the vector 
in an animal body, preferably a mammal body, for example a mouse body. 

In another embodiment of the vector according to the invention, it may be introduced in 
vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated 
and more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has 
35 been transformed with the vector coding for the desired leptin polypeptide or the desired 
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fragment thereof is reintroduced into the animal body in order to deliver the recombinant protein 
within the body either locally or systemically. 

V. Recombinant Cells of the Invention 
5 The invention is in part based on the surprising and unexpected discovery that the 

different subunits of LSR interact to form at least two very different receptors : LSR-lep and 
LSR-tg. The LSR-lep receptor requires at least a*. In some embodiments a combination with 
(3 and/or a as well as a* is preferred. The LSR-tg receptor requires a combination of at least a 
and p. In some embodiments a combination with P and/or a as well as a 1 is preferred. Based 

10 on this novel and unexpected finding, it has become critical to engineer cells lacking 

endogenous LSR activity/expression (e.g. as a result of a classical knock-out, chimeraplasty, or 
zinc finger protein inhibition), and then to re-transfect the subunits of interest in various 
combinations and at various levels. This will allow not only the study of these receptors in 
isolation, but also the design of specific inhibitors for the different receptors, and the assessment 

15 of what genes may act to regulate or modulate the receptors, or to transmit the intracellular 
signals from or for each receptor. Although LSR-lep and LSR-tg have been identified, it is 
possible that other LSR receptors with other activities also exist and can be identified by these 
methods. 

Recombinant cells have been designed that are useful in many situations, including : (1) 

20 the study of the role of the various LSR components in isolation and together with and without 
interference from endogenous LSR, (2) as part of an assay system to discover modulators of the 
leptin/LSR interaction, for example, using known components of the LSR system (and in some 
cases no endogenous LSR components; see above), and (3) to produce various polypeptides of 
the invention (see above). To this end, in preferred embodiments, a recombinant cell is 

25 transiently, or preferably stably, transfected with one or more LSR subunits selected from the 
group consisting of a, a' and p. Preferably, the two or more subunits are expressed in pairwise 
ratios to each other of from 1:1 to 1:5. For example, if a and P are present in a cell, cells with 
ratios of 1 :1, 1 :2, 1 :3, 1 :4, 1 :5, 5 :1, 4 :1, 3 :1, 2 :1, as well as 2 :3, 3 :2, 3 :4, 4 :3, 3 :5, 5 :3, 
4 :5, and 5 :4, etc. are preferred. Similar ratios are desired for cells containing a' and p. When 

30 all three subunits are present, cells with all possible combinations of ratios are preferred. These 
are most easily obtained by screening cells (wild-type, transfected, or knockout, for example) 
for their expression levels of the various subunits. Preferably, the one or more LSR components 
are a' and P, and preferably the recombinant cells are cultured PLC cells. However, the cells 
can be selected from any of the cells in the ATCC bank. The LSR polypeptides, the 

35 polynucleotides encoding LSR, and the vectors to transfer the polynucleotides encoding LSR 
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between cells and tissues have been described previously (US National phase application No. 
09/269,939). 

Another object of the invention consists of host cells that have been transformed or 
transfected with one of the polynucleotides described herein, and more precisely a 
5 polynucleotide comprising: a polynucleotide encoding a leptin polypeptide of the invention, or a 
polynucleotide encoding a zinc finger protein of the invention. These polynucleotides can be 
present in the same cell or in a different cell, and can be present in cells transiently or stably 
transfected with any combination of the components of LSR. 

In another embodiment, the invention features cells that lack expression of at least one of 
1 0 the LSR subunits. These can be cells identified by screening processes, but they are preferably 
recombinant cells that have had the gene for LSR knocked-out by traditional techniques well 
known in the art; a cell in which a polynucleotide encoding a zinc finger protein of the invention 
has been transfected that either constitutively suppresses the expression of at least one subunit of 
LSR or whose suppression of LSR can be regulated by the Tet On/Off system, for example; or a 
1 5 cell in which the expression of at least one subunit of LSR has been inhibited as the result of the 
transfection of chimeric oligonucleotides of the invention. 

The invention further features either transiently, or preferably stably, transfecting the 
LSR knockout cells (or zinc finger protein cells) in which expression of at least one, and in some 
cases all, of the endogenous LSR subunits has been inhibited (or eliminated), with at least one, 
20 preferably at least two, and alternatively three, of the LSR subunits and then selecting/screening 
for cells expressing the various ratios of subunits as described above. Preferably, P, a or a' 
alone are transfected, or alternatively a' and P, or a and P together are transfected. 

The invention includes host cells that are transformed (prokaryotic cells) or that are 
transfected (eukaryotic cells) with a recombinant vector such as any one of those described in 
25 « Recombinant Vectors of the Invention ». 

Generally, a recombinant host cell of the invention comprises at least one of the 
polynucleotides or the recombinant vectors of the invention which are described herein, but also 
includes those cells in which the gene for LSR has been knock-out by traditional recombinant 
techniques, zinc finger techniques, or using chimeraplast oligonucleotides. 
30 Preferred host cells used as recipients for the recombinant vectors of the invention are 

the following : 

a) Prokaryotic host cells : Escherichia coli strains (Le. DH5-a strain), Bacillus subtilis, 
Salmonella typhimurium, and strains from species like Pseudomonas, Streptomyces and 
Staphylococcus, and 

35 b) Eukaryotic host cells : HeLa cells (ATCC N°CCL2; N°CCL2.1; N°CCL2.2), Cv 1 

cells (ATCC NTCCL70), COS cells (ATCC N°CRL1650; N°CRL1651), Sf-9 cells (ATCC 
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N°CRL171 1), Cm cells (ATCC N° CRL-1 804), 3T3 (ATCC N° CRL-6361), CHO (ATCC N° 
CCL-61), human kidney 293 (ATCC N° 45504; N° CRL-1573), BHK (ECACC N° 84100501 ; 
N° 841 1 1301), PLC cells, HepG2, Hepa 1-6, and Hep3B. 

The constructs in the host cells can be used in a conventional manner to produce the 
5 gene product encoded by the recombinant sequence. 

Following transformation of a suitable host and growth of the host to an appropriate cell 
density, the selected promoter is induced by appropriate means, such as temperature shift or 
chemical induction, and cells are cultivated for an additional period. 

Cells are typically harvested by centrifugation, disrupted by physical or chemical means, 
1 0 and the resulting crude extract retained for further purification. 

Microbial cells employed in the expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of 
cell lysing agents. Such methods are well known by the skilled artisan. 

Further, according to the invention, these recombinant cells can be created in vitro or in 
1 5 vivo in an animal, preferably a mammal, most preferably selected from the group consisting of 
mice, rats, dogs, pigs, sheep, cattle, and primates, not to include humans. Recombinant cells 
created in vitro can also be later surgically implanted in an animal, for example. Methods to 
create recombinant cells in vivo in animals are well-known in the art, and are specifically meant 
to include the techniques associated with chimeraplasty described herein and known in the art, 
20 whereby the chimeraplast oligonucleotides are provided to the cells in the animal by the use of 
liposomes, preferably liposomes that have targeting molecules for cells containing LSR such as 
LSR binding proteins or ligands, such as apml, Clq, or leptin, for example, in the membrane 
layer. 

25 VIQ. Assays for Identifying Modulators of LSR Activity 

The surprising and unexpected discovery that the different subunits of LSR interact to 
form at least two very different receptors (LSR-lep and LSR-tg) with different activities has 
resulted in the necessity of designing novel assays to identify inhibitors for the different LSR 
receptors. In particular, these assays will preferably utilize the recombinant cells of the 

30 invention, that are engineered to lack endogenous LSR activity/expression (e.g. as a result of a 
classical knock-out, chimeraplasty, or zinc finger protein inhibition). These cells are then re- 
transfected with the subunits of interest in various combinations and at various levels. Preferred 
combinations include those that give rise to the LSR-lep receptor that requires at least a', but 
may also include combination of a' and P, and the LSR-tg receptor that requires a combination 

35 of a and p. Other combinations (and the individual subunits) are also useful to look for other 
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LSR receptor activities and as controls for the activity of compounds (or genes) selected in the 
other assays. 

The invention features methods of screening for one or more compounds that modulate 
LSR activity in cells, that includes providing potential compounds to be tested to the cells, and 
5 where modulation of LSR activity indicates the one or more compounds. In some preferred 
embodiments, the potential compounds are compounds that have been molecularly designed 
based on the identified fragment of leptin that binds and activates LSR as described herein. 

In a preferred embodiment, the invention features a method for selecting a compound 
useful for the treatment or prevention of an obesity-related disease or disorder, comprising: 

10 contacting a recombinant cell that comprises a zinc finger protein of the invention, or a 

recombinant vector comprising any of the zinc finger proteins of the invention with a candidate 
compound; and detecting a result selected from the group consisting of a modulation of an 
activity of the Lipolysis Stimulated Receptor and modulation of expression of the Lipolysis 
Stimulated Receptor; as a means for selecting said compound useful for the treatment or 

15 prevention of said obesity-related disease or disorder. 

In preferred embodiments, said contacting is in the presence of a ligand of said Lipolysis 
Stimulated Receptor. Preferably, said ligand is selected from the group consisting of cytokine, 
lipoprotein, free fatty acid, adipoQ (Apml and Acrp30), and Clq, and more preferably said 
cytokine is leptin. Alternatively, said free fatty acid is oleate. In other preferred embodiments, 

20 said leptin is a leptin polypeptide fragment that modulates the activity of LSR, comprising at 
least 4, but not more than 50 contiguous amino acids of any one of the leptin polypeptide 
sequences set forth in Figure 13, wherein said at least 4 and not more than 50 contiguous amino 
acids comprise the leptin fragment central sequence. In other preferred embodiments, said leptin 
is a variant of a leptin polypeptide fragment that modulates the activity of LSR, consisting of a 

25 22 contiguous amino acid sequence that is at least 75% identical to the leptin fragment variable 
region of any one of the leptin polypeptide sequences set forth in Figure 13. Optionally, the 
leptin fragment is any leptin fragment of the invention described herein. 

In other preferred embodiments of the invention, said activity is selected from the group 
consisting of binding of lipoproteins, uptake of lipoproteins, degradation of lipoproteins, binding 

30 of leptin, uptake of leptin, and degradation of leptin. Preferably, said modulation of LSR activity 
is an increase in said activity, and optionally a decrease in said activity. In other preferred 
embodiments, said expression is on the surface of said cell, and preferably said detecting 
comprises FACS, more preferably said detecting further comprises antibodies that bind 
specifically to said LSR, wherein said LSR comprises an amino acid sequence at least 75% 

35 homologous to at least one of the sequences selected from the group consisting of SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ED NO:9, SEQ ID NO: 1 1, SEQ ID NO: 17, SEQ ID NO: 18, 
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and SEQ ID NO: 19. In other preferred embodiments, said amino acid sequence is at least 80, 85, 
90, 95, or 99 to 100% homologous to at least one of the sequences selected from the group 
consisting of SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ 
ID NO: 1 7, SEQ ID NO: 1 8, and SEQ ID NO: 1 9. In other preferred embodiments, said antibodies 

5 bind specifically to a region of said LSR selected from the group consisting of an amino 

terminus, a carboxy terminus, a splice site, a cytokine binding site, a fatty acid binding site, a 
clathrin binding site, an apoprotein ligand binding site, a LI/LL motif, a RSRS motif, and a 
hydrophobic region. Preferably, said cell is selected from the group consisting of PLC, CHO- 
Kl, Hep3B, and HepG2, although any cell expressing detectable levels of LSR can be used. 

1 0 Antibodies to LSR and to the various regions of LSR have been extensively described 

previously in US National application 09/269,939, filed May 28, 1999 and its related PCT 
application. In addition, specific antibodies to LSR are described in the Examples (1-8). 

In preferred embodiments, said candidate compound is selected from the group 
consisting of peptides, peptide libraries, non-peptide libraries, peptoids, fatty acids, lipoproteins, 

15 medicaments, antibodies, and small molecules, and optionally can include leptin mimetics 

designed by methods of the invention. The compounds may be active in vitro or in vivo. The 
activity may be increased or decreased ; the compounds may be antagonists or agonists. 
Preferably, said obesity-related diseases and disorders are selected from the group consisting of 
obesity, anorexia, cachexia, cardiac insufficiency, coronary insufficiency, stroke, hypertenison, 

20 atheromatous disease, atherosclerosis, high blood pressure, non-insulin-dependent diabetes, 
hyperlipidemia, and hyperuricemia. The compounds may also modulate body mass. Most 
preferably, the diseases include congenital generalized lipodystrophy. 

In other highly preferred embodiments of the invention, the cells used in the above- 
described assays cells have been modified to express none, or a subset, of the LSR subunits. 

25 The recombinant cells containing zinc finger proteins of the invention are also transfected with 
at least one polynucleotide encoding a LSR polypeptide comprising a sequence at least 75% 
identical to an amino acid sequence selected from the group consisting of .SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO: 7, SEQ ED NO:9, SEQ ID NO:l 1, SEQ ID NO: 17, SEQ ID NO: 18, and 
SEQ ID NO: 19. Preferably the LSR subunit is stably transfected. Preferably the cell is selected 

30 from the group consisting of PLC, CHO-K1, Hep3B, Hepa 1-6, and HepG2. However, other 
cells available from the ATCC, for example, may also be used. In addition, cells with the 
endogenous LSR gene « knocked out » by methods well-known in the are are also expressly 
contemplated (as an option to the use of the zinc finger proteins of the invention, or to the use of 
the chimeraplasts of the invention.). Cells, preferably modified cells, are transfected with one or 

35 more LSR components that may include one, part, or all, of a', a, and P, most preferably a' and 
p. Recombinant cells useful for assays to identify modulators of the leptin-LSR interaction 
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include those described in the « Recombinant Cells of the Invention ». In particular, cells 
expressing a range of ratios of the subunits are desired, including 1:1,1 :2, 1 :3, 1 :4, 1 :5, 5 :1, 
4:1,3:1,2:1, as well as 2 :3, 3 :2, 3 :4, 4 :3, 3 :5, 5 :3, 4 :5, and 5 :4, etc. for a* to (3 or a to P, 
or even a to a', for example. In addition, the various combinations where all three subunits are 
5 present in a cell are also envisioned to be useful in assays for modulators of LSR activity. 

In highly preferred embodiments of the invention, cells with endogenous LSR activity 
knocked-out and transfected with the a' alone, or a' and p LSR subunits together are used to 
screen for modulators of the LSR-leptin interaction. In other preferred embodiments, the a and 
P LSR subunits are used to screen for modulators of triglyceride-rich lipoprotein binding, 
1 0 uptake, and degradation. Cells with all three LSR subunits are useful to screen for modulators 
of the effect of leptin binding uptake and degradation on triglyceride-rich lipoprotein binding, 
uptake and degradation. Similarly, these cells would be useful for screening molecules arising 
from the active leptin fragment molecular modeling described herien. 

15 DC. Methods for Designing Leptin Polypeptide Fragment Mimetics 

Following the discovery of the differential results of human and mouse leptin on human 
and rodent LSR, the region of amino acid sequence sharing the least homology between the two 
homologs was identified and was found to stimulate rodent and human LSR activity 
differentially (Examples 1-8). Identification of the differences between these two highly similar 

20 peptides allows the design of small molecule activators or inhibitors of LSR. Methods of 
determining the differences are well known in the art and include, but are not limited to 
techniques such as molecular dynamic assays, X-ray crystallography, and NMR. Previously, 
these kinds of techniques for creating inhibitors/activators of enzymes have been used 
successfully in the art. Potential small molecule activators/inhibitors designed or identified by 

25 these methods can be tested in the assays described herein. Those that function in these assays 
can then be tested for their effectiveness for treatment of obesity-related disorders and diseases, 
as described herein, for activity in modulating body mass, and for activity in treating congenital 
generalized lipodystrophy (Example 14). 

The invention features a method of designing mimetics of a leptin fragment that 

30 modulates an activity of LSR, comprising: identifying critical interactions between one or 

more amino acids of said leptin fragment and LSR; designing potential mimetics to comprise 
said critical interactions; and testing said potential mimetics ability to modulate said activity 
as a means for designing said mimetics. By « designing mimetics » as used herein is meant 
comparing and combining known molecules to obtain a molecule that is able to mimic some 

35 or all of the activities modulated by leptin, or to preferentially increase or decrease some of 
the activities normally modulated by leptin. These activities include, but are not limited to 
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those activities selected from the group consisting of leptin binding, leptin uptake, leptin 
degradation, triglyceride binding, triglyceride uptake, and triglyceride degradation. The 
methods of comparing and combining use molecular modeling, X-Ray crystallography and 
other techniques well-known in the art to identify the critical interactions. These critical 
5 interactions include, but are not limited to those selected from the group consisting of 
hydrogen bonding, covalent bonding, Van der Waals forces, steric hindrances, and 
hydrophobic interactions. These critical interactions are identified using assays that include, 
but are not limited to, those selected from the group consisting of NMR, X-ray 
crystallography, and computer modeling. Preferably the non-leptin compounds that are 

1 0 identified or designed by these means include, but are not limited to, small molecules 

(molecular weight <500, alternatively between 500 and 1000 MW, or >1,000 MW), peptides, 
peptide libraries, non-peptide molecules, non-peptide libraries and peptoids. 

In preferred embodiments, the leptin fragment to be mimicked consists of the leptin 
fragment variable region of any one of the leptin polypeptide sequences set forth in Figure 13, 

1 5 preferably the human or mouse sequence, most preferably the human sequence. In other 

embodiments, the leptin fragment consists of the leptin fragment central sequence of any one of 
the leptin polypeptide sequences set forth in Figure 13, preferably the human or mouse sequence, 
most preferably the human sequence. 

Methods of studying the structure of enzyme-substrate complexes are well known in the 

20 art. X-Ray crystallography allows the determination of the precise three-dimensional positions 
of most of the atoms in a protein molecule. To do this, a source of x-rays, a protein crystal, and a 
detector are needed. Obtaining the crystal is necessary because the techniques requires that all 
the molecules are precisely positioned. Methods to produce crystals are well-known in the art. 
X-rays going through the protein crystal are scattered by electrons, thus the amplitude of the 

25 wave scattered by an atom is proportional to its number of electrons. The scattered waves then 
recombine, either reinforcing one another on the film or cancelling each other out, depending on 
the atomic arrangement. From this information, the image is formed by applying a mathematical 
relation called a Fourier transform, and from here an electron-density map can be calculated, and 
then interpreted. The limiting resolution for a protein with a good crystal is typically 2 A. 

30 Two methods important for enzyme-ligand interactions include (1) the difference Fourier 

method, and (2) production of stable complexes. In the Fourier method, the enzyme is 
crystallized (in this case LSR) and then the X-ray diffraction of the crystalized protein in solvent 
is compared with the X-ray diffraction of the crystallized protein in the presence of ligand (in 
this case the 22 amino acid leptin peptide). Provided that there are no drastic changes in the 

35 structure or packing of the protein when it binds the ligand, the structure of the complex can be 
solved by comparing the diffrences between the diffraction patterns. This allows the electron 
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density of the bound ligand and minor changes in the protein structure to be obtained without 
starting from scratch. 

Alternatively, the X-ray diffraction pattern of a stably bound complex can be used to 
determine the protein-ligand interactions. Sometimes this is done using an inhibitor of the 
5 ligand, but can also be achieved under unreactive conditions such as : (1) weakly reactive 
conditions due to pH conditions, ionic state, or very low temperature, (2) using a chemically 
modified protein or ligand in which important residues are modified, or (3) under conditions in 
which the equilibrium conditions are shifted. 

X-ray crystallography can be complemented by nuclear magnetic resonance (NMR) 

10 spectroscopy, which can reveal the structure of macromolecules in solution. Certain atomic 
nuclei such as hydrogen are intrinsically magnetic. The spinning of the positively charged 
proton, generates a magnetic moment. This moment can take either of two orientations when an 
external magnetic field is applied. The flow of electrons around a magnetic nucleus generates a 
small local magnetic field that opposes the external field. Under different environments the 

1 5 energy is absorbed at different resonance frequencies, an effect termed a chemical shift. 
Comparison of the shifts and spin-spin couplings, as well as the nuclear Overhauser effect 
(NOESY spectra) leads to the identification of pairs of protons that are less than 5 A apart. 
Overlapping peaks in NOESY spectra can be further resolved by obtaining NMR spectra of 
proteins labelled with 15N and 13C (multidimensional NMR spectroscopy). Typically highly 

20 concentrated solutions of proteins are required (1 mM or 15 mg/ml for a 15 kd protein) and the 
size is generally limited to 30 kd. 

Molecular modelling by computer is also used extensively to augment, supplement and 
integrate the information gained by X-Ray crystallography, NMR, EPR and other techniques. In 
particular, computer programs such as DOCK allow the prediction, identification, and three-D 

25 testing of inhibitors and activators of enzymes. This methodology has been used successfully 
previously to identify inhibitors. Basically, using the information gained from X-ray 
crystallography, NMR, and direct modelling, computer programs can now predict the residues 
that are important for the ligand-protein interactions and can predict structures that can perform 
the same interactions and test compounds proposed to be able to perform the same interactions. 

30 Through this interplay, molecules can be designed and identified to activate LSR in the manner 
of the leptin peptide, or to inhibit this interaction. The advantages to designing a molecule in 
this way include the ability to use compounds that the body cannot metabolize as rapidly as a 
peptide, that are less expensive to make, and that hopefully lack any unwanted leptin-associated 
side-effects. 
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X. Pharmaceutical Compositions of the Invention 

The identified compounds can be administered to a mammal, including a human patient, 
alone or in pharmaceutical compositions where they are mixed with suitable carriers or excipient(s) 
at therapeutically effective doses to treat or ameliorate a variety of disorders associated with lipid 
5 metabolism. A therapeutically effective dose further refers to that amount of the compound 
sufficient to result in amelioration of symptoms of obesity-related diseases or disorders as 
determined by the methods described herein. Thus, a therapeutically effective dosage of a leptin 
polypeptide fragment of the invention, or an antagonist or agonist of the leptin-LSR interaction, or a 
leptin fragment mimetic designed from molecular modeling studies, will be that dosage of the 

1 0 compound that is adequate to promote reduced or increased triglyceride-rich lipoprotein levels 

following a high-fat meal and that will promote weight loss or weight gain with continued periodic 
use or administration. Similarly, a therapeutically effective dosage of a chimeric oligonucleotide of 
the invention or a polynucleotide encoding a zinc finger protein of the invention will be that dosage 
of the compound that is adequate to increase or reduce triglyceride-rich lipoprotein levels 

1 5 following a high-fat meal and that will promote weight loss or weight gain with continued periodic 
use or administration. Techniques for formulation and administration of the compounds of the 
instant application may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., 
Easton, PA, latest edition. 

Additional aspects of the invention feature the use of the compounds, chimeric 

20 oligonucleotides and zinc fingers, described throughout the application as modulators of LSR 
activity in the making of medicaments for the treatment of diseases and disorders described in 
the following section as well as throughout the application. These diseases or disorders include, 
but are not limited to, anorexia, cachexia, AIDS-related weight loss, neoplasia-related weight 
loss, or obesity-related atherosclerosis, obesity-related insulin resistance, obesity-related 

25 hypertension, microangiopathic lesions resulting from obesity-related Type II diabetes, ocular 
lesions caused by microangiopathy in obese individuals with Type II diabetes, and renal lesions 
caused by microangiopathy in obese individuals with Type II diabetes. Modulators of body mass 
are also expressly included, as are compounds (such as the leptin fragments of the invention) for 
treating congenital generalized lipodystrophy. 
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Routes of Administration. 

Suitable routes of administration include oral, rectal, transmucosal, or intestinal 
administration, parenteral delivery, including intramuscular, subcutaneous, intramedullary 
injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal or 
5 intraocular injections. A particularly useful method of administering compounds for promoting 
weight loss involves surgical implantation, for example into the abdominal cavity of the recipient, 
of a device for delivering the compound over an extended period of time. Sustained release 
formulations of the invented medicaments particularly are contemplated. 



10 Composition/Formulation 

Pharmaceutical compositions and medicaments for use in accordance with the present 
invention may be formulated in a conventional manner using one or more physiologically 
acceptable carriers comprising excipients and auxiliaries. Proper formulation is dependent upon the 
route of administration chosen. 

1 5 Certain of the medicaments described herein will include a pharmaceutically acceptable 

carrier and at least one polypeptide that is a leptin polypeptide of the invention. In addition to 
medicaments that include leptin polypeptides of the invention, non-protein compounds designed 
based on molecular modeling of the active leptin polypeptide of the invention also will find utility 
as modulators of LSR activity, both in vitro and in vivo. Further, antagonists and agonists of the 

20 leptin-LSR interaction, including leptin and/or triglyceride-rich lipoprotein binding, uptake and 
degradation will also find utility in modulating LSR activity and/or stimulating a reduction of 
plasma lipoproteins and/or promoting weight loss. 

For injection, the agents of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 

25 physiological saline buffer such as a phosphate or bicarbonate buffer. For transmucosal 

administration, penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Pharmaceutical preparations that can be taken orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

30 sorbitol. The push-fit capsules can contain the active ingredients in admixture with fillers such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable for 

35 such administration. 



WO 01/21647 PCT/IB00/01470 

50 

For buccal administration, the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present invention 
are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a 
5 nebulizer, with the use of a suitable gaseous propellant, eg., carbon dioxide. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of, e.g., gelatin, for use in an inhaler or insufflator, may be 
formulated containing a powder mix of the compound and a suitable powder base such as lactose or 
starch. 

1 0 The compounds may be formulated for parenteral administration by injection, e.g. t by 

bolus injection or continuous infusion. Formulations for injection may be presented in unit dosage 
form, e.g., in ampoules or in multi-dose containers, with an added preservative. The compositions 
may take such forms as suspensions, solutions or emulsions in aqueous vehicles, and may contain 
formulatory agents such as suspending/stabilizing and/or dispersing agents. 

1 5 Pharmaceutical formulations for parenteral administration include aqueous solutions of the 

active compounds in water-soluble form. Aqueous suspensions may contain substances that 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents that increase the 
solubility of the compounds to allow for the preparation of highly concentrated solutions. 

20 Alternatively, the active ingredient may be in powder or lyophilized form for constitution 

with a suitable vehicle, such as sterile pyrogen-free water, before use. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. Thus, 

25 for example, the compounds may be formulated with suitable polymeric or hydrophobic materials 
(for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble 
derivatives, for example, as a sparingly soluble salt. 

Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. Various 

30 sustained release materials have been established and are well known by those skilled in the art. 
Sustained-release capsules may, depending on their chemical nature, release the compounds for a 
few weeks up to over 100 days. 

Depending on the chemical nature and the biological stability of the therapeutic reagent, 
additional strategies for protein stabilization may be employed. 

35 The pharmaceutical compositions also may comprise suitable solid or gel phase carriers or 

excipients. Examples of such carriers or excipients include but are not limited to calcium 
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carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and polymers 
such as polyethylene glycols. 

Effective Dosage. 

5 Pharmaceutical compositions suitable for use in the present invention include compositions 

wherein the active ingredients are contained in an effective amount to achieve their intended 
purpose. More specifically, a therapeutically effective amount means an amount effective to 
prevent development of or to alleviate the existing symptoms of the subject being treated. 
Determination of the effective amounts is well within the capability of those skilled in the art, 

1 0 especially in light of the detailed disclosure provided herein. 

For any compound used in the method of the invention, the therapeutically effective dose 
can be estimated initially from cell culture assays. For example, a dose can be formulated in animal 
models to achieve a circulating concentration range that includes or encompasses a concentration 
point or range shown to effect enhanced or inhibited LSR activity in an in vitro system. Such 

1 5 information can be used to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms in a patient. Toxicity and therapeutic efficacy of such compounds can be 
determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50, (the dose lethal to 50% of the test population) and the ED50 (the dose 

20 therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as the ratio between LD50 and ED50. 
Compounds that exhibit high therapeutic indices are preferred. 

The data obtained from these cell culture assays and animal studies can be used in 
formulating a range of dosage for use in human. The dosage of such compounds lies preferably 

25 within a range of circulating concentrations that include the ED50, with little or no toxicity. The 
dosage may vary within this range depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of administration and dosage can be chosen 
by the individual physician in view of the patient's condition. (See, e.g., Fingl et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1). 

30 Dosage amount and interval may be adjusted individually to provide plasma levels of the 

active compound which are sufficient to maintain the LSR modulating effects. Dosages necessary 
to achieve the LSR modulating effect will depend on individual characteristics and route of 
administration. 

Dosage intervals can also be determined using the value for the minimum effective 
35 concentration. Compounds should be administered using a regimen that maintains plasma levels 
above the minimum effective concentration for 10-90% of the time, preferably between 30-90%; 
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and most preferably between 50-90%. In cases of local administration or selective uptake, the 
effective local concentration of the drug may not be related to plasma concentration. 

The amount of composition administered will, of course, be dependent on the subject being 
treated, on the subjects weight, the severity of the affliction, the manner of administration and the 
5 judgment of the prescribing physician. 

A preferred dosage range for the amount of a leptin polypeptide of the invention, or 
compound designed based on its molecular modeling, or an antagonist or agonist of its activity with 
LSR, that can be administered on a daily or regular basis to achieve desired results, including a 
reduction in levels of circulating plasma triglyceride-rich lipoproteins, range from 0.1 - 50 mg/kg 
1 0 body mass. A more preferred dosage range is from 0.2 - 25 mg/kg. A still more preferred dosage 
range is from 1.0-20 mg/kg, while the most preferred range is from 2.0-10 mg/kg. Of course, 
these daily dosages can be delivered or administered in small amounts periodically during the 
course of a day. 

15 XI. Methods of Preventing or Treating Obesitv-related Diseases and Disorders 

A method of preventing or treating obesity-related diseases and disorders comprising 
providing a patient in need of such treatment with a leptin polypeptide fragment or a leptin 
mimetic of the invention. Preferably, the leptin polypeptide fragment or mimetic modulates 
the activity of LSR, more preferably increases the activity of LSR, and optionally decreases 

20 the activity of LSR either in vitro or in vivo. Preferably the leptin polypeptide fragment or 
mimetic is provided to the patient in a pharmaceutical composition that is preferably taken 
orally. Preferably the patient is a mammal, and most preferably a human. In preferred 
embodiments, the obesity-related disease or disorder is selected from the group consisting of 
anorexia, cachexia, AIDS-related weight loss, neoplasia-related weight loss, or obesity-related 

25 atherosclerosis, obesity-related insulin resistance, obesity-related hypertension, microangiopathic 
lesions resulting from obesity-related Type II diabetes, ocular lesions caused by microangiopathy 
in obese individuals with Type II diabetes, and renal lesions caused by microangiopathy in obese 
individuals with Type II diabetes. Modulators of body mass (weight gain or loss) are also 
expressly included, as are compounds (such as the leptin fragments of the invention) for treating 

30 congenital generalized lipodystrophy. 

Alternatively, the invention features a method of preventing or treating obesity-related 
diseases and disorders comprising providing a patient in need of such treatment with a 
compound identified by assays of the invention. Preferably these compounds antagonize or 
agonize the interaction of leptin and LSR. In other embodiments, the compounds are those 

35 created as a result of the molecular modeling of the active leptin polypeptide and are non- 
peptide mimetics that function in the same manner as the active leptin polypeptide of the 
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invention. Preferably, the compound is provided to the patient in a pharmaceutical 
composition that is preferably taken orally. Preferably the patient is a mammal, and most 
preferably a human. In preferred embodiments, the obesity-related disease or disorder is 
selected from the group consisting of anorexia, cachexia, AIDS-related weight loss, neoplasia- 
5 related weight loss, or obesity-related atherosclerosis, obesity-related insulin resistance, obesity- 
related hypertension, microangiopathic lesions resulting from obesity-related Type II diabetes, 
ocular lesions caused by microangiopathy in obese individuals with Type II diabetes, and renal 
lesions caused by microangiopathy in obese individuals with Type II diabetes. Modulators of 
body mass are also expressly included, as are compounds (such as the leptin fragments of the 

1 0 invention) for treating congenital generalized lipodystrophy. 

The invention also features a method for treating or preventing obesity-related diseases 
or disorders involving gene surgery. To this end, it is advantageous in some conditions to either 
express more or less LSR, or alternatively to express more or less of one or more LSR subunits. 
Using the methods described herein, it is possible to modulate the levels of expression of LSR, 

15 or of some LSR subunits using zinc finger polypeptides of the invention or chimeric 

oligonucleotides of the invention. Preferably, the zinc finger polypeptides are provided to an 
individual in need of such treatment by polynucleotides encoding the zinc finger polypeptides of 
the invention. Preferably the zinc finger polynucleotides of the invention are present in a 
recombinant vector, preferably a retroviral vector, more preferably AAV. Preferably the 

20 . chimeric oligonucleotides are provided to a patient in need of such treatment using liposomes. 
Preferably the liposomes are constructed such that molecules targeting the liposomes to cells 
containing LSR are present in the membrane. Preferably the molecules include leptin, apml , and 
Clq, for example. Alternatively they may have compounds that target them to the liver, such as 
glucose, for example, or alternatively to adipose tissue. Preferably the patient is a mammal and 

25 the obesity-related disease or disorder is selected from the group consisting of anorexia, 
cachexia, AIDS-related weight loss, neoplasia-related weight loss, or obesity-related 
atherosclerosis, obesity-related insulin resistance, obesity-related hypertension, microangiopathic 
lesions resulting from obesity-related Type II diabetes, ocular lesions caused by microangiopathy 
in obese individuals with Type II diabetes, and renal lesions caused by microangiopathy in obese 

30 individuals with Type II diabetes. Modulators of body mass are also expressly included, as are 
compounds (such as the leptin fragments of the invention) for treating congenital generalized 
lipodystrophy. 

Still another aspect of the invention relates to the use of chimeric oligonucleotides to 
specifically alter single nucleotide polymorphisms in a patient in need of such treatment. Single 
35 polymorphisms associated with the LSR gene and with obesity have been described in U.S. 
provisional application No. 60/1 19,592, entitled "Polymorphic Markers of the LSR gene" by 
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Blumenfeld et al, filed February 10, 1999, shown in Table A . In one embodiment, this 
medicament can be used for reducing food intake in obese individuals, reducing the levels of free 
fatty acids in obese individuals, decreasing the body weight of obese individuals, or treating an 
obesity related condition selected from the group consisting of obesity-related atherosclerosis, 
5 obesity-related insulin resistance, obesity-related hypertension, microangiopathic lesions 
resulting from obesity-related Type II diabetes, ocular lesions caused by microangiopathy in 
obese individuals with Type II diabetes, and renal lesions caused by microangiopathy in obese 
individuals with Type II diabetes. Modulators of body mass are also expressly included, as are 
compounds (such as the leptin fragments of the invention) for treating congenital generalized 
10 lipodystrophy. 



Table A 



BiaUelic 
Marker 


Marker 
Name 


Localization 
in LoK vjene 


Polymorphism 


Frequency 
yji Aiieie l 


AA 

Change 


Marker Position 


99-14410/373 


A1 


5 'regulatory 
region 


Allele 1:C 






373 of 

^FD ID Nn 2 


99-14424/353 


A2 


5 'regulatory 
region 


Allele 1: A 






353 of 


09- 144 18/322 


A3 


mJ t CtfUlLllUr y 

region 


Allele 1 ■ A 

Allele 2: G 






322 of 

SEQ ID No 4 


99-14417/126 


A4 


5 'regulatory 
region 


Allele 1:C 
Allele 2: T 






126 of 

SEQ ID No 5 


99-14417/334 


A5 


5 'regulatory 
region 


Allele 1:C 
Allele 2: T 






334 of 
SEQ ID No 5 


99-14415/106 


A6 


5 'regulatory 
region 


Allele 1:C 
Allele 2: T 






106 of 

SEQ ID No 6 


99-14413/250 


A7 


5 'regulatory 
region 


Allele 1: A 
Allele 2: C 






250 of 
SEQ ID No 7 


99-14413/383 


A8 


5 'regulatory 
region 


Allele 1:0 
Allele 2: T 






383 of 
SEQ ID No 7 


99-4575/226 


A9 


5 'regulatory 
region 


Allele 1:T 
Allele 2: C 


25% 




226 of 
SEQ ID No 8 


9-19/148 


A10 


5 'regulatory 
region 


Allele 1:C 
Allele 2: T 


15% 




1243 of 
SEQ ID No 1 


9-19/307 


A11 


5 'regulatory 


Allele 1: A 


12% 




1401 of 
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AIIpIp ?- T 

/A11C1C X.. 1 






9FO TD Nn 1 


9-19/442 


A12 


5 'regulatory 
region 


Allele 1:C 






1535 of 


9-20/187 


A13 


5 'regulatory 
region 


Allele 1 : A 
Allele 2: C 






1788 of 
SEQIDNol 


9-1/308 


A14 


Intron I 


Allele 1:C 
Allele 2: G 


24% 




2391 of 
SEQIDNo 1 


9-3/324 


AID 


Exon2 


Allele l : C 
Allele 2: T 


29% 




3778 of 
SEQIDNol; 
595 of SEQID 
Nos 13, 15, and 
17 


99-14419/424 


A16 


Intron 2 


Allele l: C 
Allele 2: A 


22% 




4498 of 
SEQ ID No 1 


9-24/260 


A17 


Intron 3 


Allele 1: A 
Allele 2: G 


35% 




15007 of 
SEQIDNo 1 


9-24/486 


A18 


Intron 4 


Allele 1:0 
Allele 2: A 


15% 




15233 of 
SEQ ID No 1 


9-6/187 


A19 


ExonS 


Allele 1:C 
Allele 2: T 


1% 




15826 of 
SEQ ID No 1; 
940 of 

SEQ ID No 13; 
883 of 

SEQIDNo 15 


9-7/148 


A20 


Intron 5 


Allele 1:0 
Allele 2: A 


35% 




19567 of 
SEQ ID No 1 


9-7/325 


A21 


Exon 6 


Allele 1:G 
Allele 2: A 


14% 


S->N 


19744 of 
SEQIDNol; 
1191 of 
SEQ ID No 13; 

1 I OI 

SEQIDNo 15; 
987 of 

SEQ ID No 17 


9-7/367 


A22 


Intron 6 


Allele 1:A 






19786 of 
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Allele 2: C 






SEQIDNo 1 


9-9/246 


A23 


Exon 8 


Allele 1:C 
Allele 2: G 


0.5% 


P-»R 


20158 of 
SEQIDNol; 
1362 of 
SfcQ ID No 13; 
1305 of 
SEQIDNo 15; 
1158 of 
SEQIDNo 17 


LSRX9-BM 
(17-1/240) 


A24 


Exon 9 


Allele 1: AGG 
Allele 2: Del 
AGG 


Del 26% 


DelR 


20595 of 
SEQIDNol; 
1658 of 
SEQIDNo 13; 
1601 of 
SEQIDNo 15; 
1454 of 
SEQIDNo 17 


LSRX10-BM 


A25 


Exon 10 


Allele 1 : T 
Allele 2 : G 






21108 of 
SEQIDNol; 
2079 of 
SEQIDNo 13; 
2022 of 
SEQIDNo 15; 
1875 of 
bEQ IL> No 1 7 


99-4580/296 


A26 


3 'regulatory 
region 


Allele 1 : A 
Allele a . o 


24% 




296 of 

oby LU ino y 


99-4567/424 


A27 


3 'regulatory 
region 


Allele 1 : C 

Allele L . 1 






424 of 

obi^l lu INO 1U 


99-14420/477 


A28 


3 'regulatory 

region 


Allele 1 : G 

AIlClC Z . 1 






477 of 

oCy LU INO 1 1 


99^582/62 


A29 


3 'regulatory 
region 


Allele 1 : A 
Allele 2 : G 






62 of 

SEQIDNo 12 


99-4582/359 


A30 


3 'regulatory 
region 


Allele 1 : G 
Allele 2 : T 


24% 




359 of 

SEQIDNo 12 
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17-2/297 


A31 


5 ' regulatory 
region 


Allele 1 : C 
Allele2:G 


48% 




818ofSEQE>No 
1 


9-19/256 


A32 


5 ' regulatory 
regio 


Allele 1 : A 
Allele 2 : G 






1374 of SEQ ID 
No 1 



XII: Methods for Selecting Genes that Modulate LSR Expression 

Another aspect of the invention features a method for selecting for genes that modulate 
5 the expression of LSR. This method relies on the use of a retroviral vector to provide cells of 
choice (those that express LSR naturally or recombinantly, and in any combination of subunits 
and subunit levels) with genes of interest at a moderate level. By « a moderate level » is meant a 
level that is intermediary between high and low, as based on the level of expression of GFP. 
Neither high nor low expression is desired since low levels might result in undetectable effects 

10 on LSR activity and high levels might co-opt the use of the cell machinery such that LSR isn't 
made simply for this reason. These moderate levels are easily detected and selected for by 
FACS analysis as described in the Examples. This method also relies on the use of FACS to 
detect changes in the activity of LSR as judged by detecting the expression of LSR, or LSR 
subunits on the surface of the cells, or alternatively intracellularly as well. This can be done by 

1 5 using two antibodies that bind specifically to different regions of LSR, for example the 8 1 B and 
93A antibodies. 

Thus, in a preferred embodiment, the invention features a method of selecting for genes 
that modulate an activity of the Lipolysis Stimulated Receptor, comprising : providing a 
retroviral gene library to cells that express said Lipolysis Stimulated Receptor; contacting said 

20 cells with a ligand of said Lipolysis Stimulated Receptor ; and detecting a change in said activity 
of the Lipolysis Stimulated Receptor as a means for selecting for said genes. Preferably, said 
retroviral gene library comprises a cDNA library from tissues selected from the group consisting 
of liver, brain, muscle, and adipose, and preferably further comprises a detectable marker protein 
selected from the group consisting of GFP, truncated CD2, and truncated CD4. In preferred 

25 embodiments, the method further comprises selecting said cells transfected with the retroviral 
vector for moderate expression of GFP. Preferably, said selecting of cells is by FACS. 

In other preferred embodiments, said ligand is selected from the group consisting of 
cytokine, free fatty acid, lipoprotein, adipoQ (Acrp30, Apml), and Clq, and preferably said 
cytokine is leptin. Preferably said free fatty acid is oleate. More preferably, said leptin is a 

30 leptin polypeptide fragment that modulates the activity of LSR, comprising at least 4, but not 

more than 50 contiguous amino acids of any one of the leptin polypeptide sequences set forth in 
Figure 13, wherein said at least 4 and not more than 50 contiguous amino acids comprise the 
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leptin fragment central sequence. Optionally, said leptin is a variant of a leptin polypeptide 
fragment that modulates the activity of LSR, consisting of a 22 contiguous amino acid sequence 
that is at least 75% identical to the leptin fragment variable region of any one of the leptin 
polypeptide sequences set forth in Figure 13. 
5 In other preferred embodiments, said detecting a change in said activity is by FACS, 

preferably said detecting further comprises fluorescent antibodies that bind specifically to said 
LSR, wherein said LSR comprises an amino acid sequence at least 75% homologous to at least 
one of the sequences selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, and 

1 0 SEQ ID NO: 16. More preferably, said antibodies bind specifically to a region of said LSR 
selected from the group consisting of an amino terminus, a carboxy terminus, a splice site, a 
cytokine binding site, a fatty acid binding site, a clathrin binding site, an apoprotein ligand 
binding site, a LI/LL motif, a RSRS motif, and a hydrophobic region. 

Antibodies to LSR and to the various regions of LSR have been extensively described 

1 5 previously in US National application 09/269,939, filed May 28, 1999 and its related PCT 
application. In addition, specific antibodies to LSR are described in the Examples (1-8). 

In other preferred embodiments said cell is selected from the group consisting of PLC, 
CHO-K1, Hep3B, and HepG2. In some of these embodiments, said cell has had the endogenous 
LSR activity inhibited by either a traditional « knockout » of the gene encoding LSR, 

20 alternatively said cell has had the expression of endogenous LSR inhibited by transfection of a 
polynucleotide encoding a zinc finger protein of the invention, or by providing a chimeric 
oligonucleotide of the invention to the cell. 

Other characteristics and advantages of the invention are described in the Brief Description 
of the Figures and the Examples. These are meant to be exemplary only, and not to limit the 

25 invention in any way. 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1 A and IB show the effect of leptin on postprandial plasma TG response in 
30 db/db and db^/db^ mice. Overnight-fasted db/db (A), db Pas /db Pas (B) mice were gavage-fed a 
high-fat test meal and immediately injected intravenously (db/db) or intraperitoneal^ 
(db Pa3 /db Pas ) with saline (open symbols) or 50 \ig mouse recombinant leptin (closed symbols). At 
the indicated times, blood was collected from the tail (A) or orbital (B) vein, plasma was 
separated by centrifugation, and plasma TG concentrations were determined using an enzymatic 
35 kit. Each point represents the mean ± SEM (db/db: saline, n = 4, leptin, n = 3; db Pas /db Pas : saline, 
n = 6, leptin, n = 7). The average plasma lipid response in 10 control C57BL6 mice is shown as 
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a dotted line in both A and B. In a separate experiment, shown as an inset for each strain of 
mice, overnight fasted db/db (•) or db Pas /db Pas (■) mice were gavage-fed the test meal and 
immediately injected intravenously with increasing concentrations of leptin. The plasma lipid 
response was then measured as in A and B. The area under the response curve (AUC) was then 
5 calculated using a triangulation method on Microsoft Excel between 0 and 4 hr (mg TG • h / 
mL). Values are presented as % of control value (test meal alone obtained in A or B). Each 
point represents the mean of at least 3 mice. 

Figure 2 shows 125 I-Leptin binding to partially purified rat LSR. Aliquots (72 jig) of partially 
purified rat liver LSR were separated on a 4%-12% SDS-gradient polyacrylamide gel, and 
10 transferred to nitrocellulose as described previously (Yen F.T., Masson M., Clossais-Besnard 
N., 

Andre P., Grosset J.M, Bougueleret L., Dumas J.B., Guerassimenko, O., and Bihain B.E. 
(1999). J Biol Chem 274, 13390-13398) a gel strip stained with Coomassie blue is shown in 
lane 1. The nitrocellulose strips were washed, blocked, and incubated with anti-rat LSR protein 

1 5 antiserum (1 : 1000 dilution)(lane 2), or with 200 ng/mL 125 I-leptin (lane 3). The strips were 

washed and bands were detected as described herein. Image analysis of lane 3 was performed 
on a Phosphorimager (Molecular Dynamics). 

Figures 3A, 3B, 3C, and 3D show the effect of LSR subunit transfection on leptin 
binding and degradation in CHO-K1 cells. For Fig. 3 A, CHO-K1 cells were transfected with 

20 increasing concentrations of the a, (□), a'( B ) or P (O) LSR plasmid, or vector alone (•) using 
Fugene transfection reagent. After 48 h, the cells were washed once in PBS and incubated at 37 
°C for 2 h with 10 ng/mL of ,25 I-leptin in DMEM containing 0.2% (w/v) BSA, 2 mM CaCl 2 and 
5 mM HEPES, pH 7.4 (Buffer A). The monolayers were washed and lysed with 0.1 N NaOH 
containing 0.24 mM EDTA, and the lysates were counted. The results are shown as the amount 

25 of cell-associated 125 I-leptin. For Fig. 3B, lysates were prepared from CHO-K1 wild type, stable 
transfectants of vector or LSR a' subunit, and PLC, and separated on a 10% SDS- 
polyacrylamide gel under denaturing and reduced conditions. After transfer to nitrocellulose, 
Western blots were performed using anti-LSR 170 antibody (can also be done with the human 
equivalent, 93 A). Northern blots were done to detect LSR mRNA in CHO-K1 wild-type versus 

30 PLC. RT-PCR analysis was also done in CHO-K1 as compared to PLC. For Fig. 3C and 3D, 
confluent monolayers of stable-transfected cell lines expressing LSR a' subunit (■) or vector 
alone (•) were washed once in PBS and incubated at 37 °C for 2 h with increasing 
concentrations of l25 I-leptin in Buffer A. The amount of cell-associated (Fig. 3C) and degraded 
(Fig. 3D) ,25 I-leptin was then measured as described herein. Results are shown as the mean of 

35 triplicate determinations. 
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Figures 4A, 4B, 4C, and 4D show LSR binding and degradation of l25 I-leptin in human 
hepatocytes, and the effect of 81B anti-LSR antibody. For Fig.4A, PLC cells were lysed (3-T175 
cm 2 flasks per condition) and immunoprecipitated with irrelevant or 8 IB anti-serum. The 
immunoprecipitates were washed, were separated on 10% SDS-polyacrylamide gels under 
5 nondenaturing conditions, and were transferred to nitrocellulose. Ligand blots using ,25 I-leptin 
were then performed as described in Figure 2. For Fig. 4B, confluent monolayers of PLC cells 
were incubated at 37 °C for 30 min with 100 nM insulin, were washed, and then were incubated 
for 30 min at room temperature in the presence of anti-LSR peptide 8 IB antibody (■), or 
irrelevant (□) IgG. After this, the cells were incubated at 37 °C for 2 h with increasing 

10 concentrations of !25 I-leptin in Buffer A. The monolayers were washed, and the amount of 125 I- 
leptin degraded was determined as described herein. Results are shown as the mean of duplicate 
(irrelevant IgG) or triplicate (anti-LSR peptide IgG) determinations. Fig. 4C is a schematic 
diagram of the motifs found from the predicted protein sequence of LSR a cDNA. A 
corresponding Kyte-Doolittle hydrophilicity plot (Lasergene, DNAstar, Madison, WI) is shown 

1 5 underneath. For Fig. 4D, PLC cell aliquots were prepared and incubated with irrelevant, 93 A or 
8 IB antibodies. After washing and incubation with goat-anti-rabbit FITC-conjugated antibody, 
the cells were fixed and analyzed by flow cytometry using a FACSCalibur (Becton Dickinson). 

Figures 5 A, 5B, 5C, and 5D show the stimulatory effect of leptin on LSR activity in PLC 
and suppression of this effect by 8 IB antibody. For Fig. 5 A, 5B, and 5C, cultured PLC cells 

20 were incubated at 37 °C for 30 min with increasing concentrations of human recombinant leptin 
in Buffer A. After this, 0.5 mM oleate and 20 ng/mL 125 I-LDL were added, and cells were 
further incubated at 37 °C for 2 h. Cells were washed, and the amount of oleate-induced l25 I- 
LDL bound (5A), internalized (5B) and degraded (5C) were measured. For Fig. 5D, PLC cells 
were incubated at room temperature for 30 min with 200 ng/mL anti-LSR peptide 81 B or 170 

25 antibody, followed by incubation at 37 °C for 30 min without (open bar) or with (hatched bar) 10 
ng/mL human leptin. Oleate (0.5 mM) and 125 I-LDL (20 ng/mL) were added, and the 
monolayers were left at 37 °C for 3 h. After washing, the amount of 125 I-LDL binding was 
determined, and is shown here as the mean ± SD of triplicate determinations. 

Figures 6A, 6B, and 6C show the effect of leptin on 125 I-LDL and ,25 I-chylomicron 

30 binding to LSR in primary cultures of rat hepatocytes. For Fig. 6 A, primary cultures of rat 
hepatocytes (48 hours after plating) were incubated at 37 °C for 30 min in the absence (□) or 
presence (■) of 50 ng/mL leptin in Buffer A, followed by a 20 min incubation at 37 °C with 0.5 
mM oleate. The cells were then washed with ice-cold PBS, precooled for 10 min, and then 
incubated for 1 h at 4 °C with increasing concentrations of ,25 I-LDL in Buffer A. Cells were 

35 washed, were lysed in 0.1 N NaOH and were counted for radioactivity. Results are shown as the 
mean of duplicate determinations. For Fig. 6B, primary cultures of rat hepatocytes were 
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incubated at 37 °C for 30 min with or without 20 ng/mL leptin followed by incubation at 37 °C 
for 4 h with 6 \ig protein/mL I25 I-chylomicrons in the absence or presence of 0.5 mM oleate in 
Buffer A. The cells were then washed and the !25 I-chylomicrons bound to the cell surface were 
released into the media by incubation with 10 mM suramin. The media was recovered and the 
5 radioactivity was measured. Results are shown as the mean ± SD of six determinations. Fir 

Figure 6C, after incubation at 37 °C for 30 min with 50 ng/mL leptin, the cells were incubated at 
room temperature for 30 min with 200 \ig IgG/mL antibodies directed against rat LSR protein or 
irrelevant IgG. The amount of 125 I-chylomicrons bound was determined, and results are shown as 
means ± SD of triplicate (irrelevant) or quadruplicate (anti-LSR) determinations. 

1 0 Figures 7 A and 7B show a comparison of the effect of human and mouse leptin on LSR 

activity in rat hepatocytes and on postprandial increase in plasma TG in db^ /db Pas mice. For 
Fig. 7A, primary cultures of rat hepatocytes were incubated 30 min at 37 °C without (open bar) 
or with 10 ng/mL recombinant human (solid bar) or mouse (hatched bar) leptin in Buffer A. 
Oleate (0.5 mM) and l25 I-LDL (20 ng/mL) were added and the cells were incubated 2 h at 37 °C. 

1 5 The media were removed and were analyzed for TCA-soluble degradation products. The mean 
of duplicate determinations is shown. For Fig. 7B, db Pas /db Pas mice were given a test meal as 
previously described, followed immediately by injection up. of saline (open bar, n = 4), human 
leptin (1 ng/animal; solid bar, n = 3) or mouse leptin (0.25 ng/animal; hatched bar; n = 3). The 
data represent the difference in TG concentrations measured at t=0 and the average of the 

20 concentrations at 3 and 4 hours. Results are shown as mean ± SEM. 

Figures 8 A and 8B show the effect of mouse or human leptin on LSR activity in primary cultures 
of rat hepatocytes or a human liver cell line (PLC). Primary cultured rat hepatocytes were 
obtained commercially (In Vitro Tech). The PLC cell line was obtained from ATCC repository 
and maintained in culture. Rat hepatocytes 72 h after plating (8A) or confluent monolayers of 

25 PLC (8B) were incubated 30 min at 37 °C with 0 (closed bar) or 10 ng/mL of human (open bar) 
or mouse (hatched bar) recombinant leptin. Following this, 0.5 mM oleate and 20 fig/mL 125 I- 
LDL were added and the cells were further incubated for 2 h at 37 °C. The cells were washed, 
and the amount of oleate-induced l25 I-LDL bound, internalized and degraded was measured as 
described previously (Bihain, B.E., and Yen, F.T. (1992). Free fatty acids activate a high-affinity 

30 saturable pathway for degradation of low-density lipoproteins in fibroblasts from a subject 

homozygous for familial hypercholesterolemia. Biochemistry 37, 4628^4636.. Results here are 
shown as % of control values obtained in the absence of leptin. Figure 9 shows the effect of 
mouse or human leptin peptide on LSR activity in PLC. Confluent PLC monolayers were 
incubated 30 min at 37 °C with increasing concentrations of mouse (■ ) or human (•) leptin 

35 peptide. Following this, 0.5 mM oleate and 20 jig/mL 125 I-LDL were added and the cells were 
further incubated for 2 h at 37 °C. The cells were washed, and the amount of oleate-induced 
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,25 I-LDL bound and degraded was measured as described previously (Bihain, B.E., and Yen, F.T. 
(1992). Free fatty acids activate a high-affinity saturable pathway for degradation of 
low density lipoproteins infibroblasts from a subject homozygous for familial 
hypercholesterolemia. Biochemistry 31, 4628-4636. 
5 Figure 10 shows the effect of mouse or human leptin peptide on LSR activity in primary cultured 
rat hepatocytes. Cells were incubated 30 min at 37 °C with increasing concentrations of mouse 
(■ ) or human (•) leptin peptide. Following this, 0.5 mM oleate and 20 ng/mL ,25 I-LDL were 
added and the cells were further incubated for 2 h at 37 °C. The cells were washed, and the 
amount of oleate-induced ,25 I-LDL bound and degraded was measured as described previously 
10 (Bihain, B.E., and Yen, F.T. (1992). Free fatty acids activate a high-affinity saturable pathway 

for degradation of low-density lipoproteins in fibroblasts from a subject homozygous for 
familial 

hypercholesterolemia. Biochemistry 31, 4628-4636). 

Figures 1 1 A and 1 IB show the effect of mouse leptin (Fig. 1 1 A) or leptin peptide (Fig. 

15 1 IB) on postprandial plasma TG response in ob/ob mice. A single dose of 50 ng mouse leptin 
(A, ■ ), leptin peptide (B, ■ ), or a comparable volume of saline (□) was injected 
subcutaneously at t = 0 h (8:30 AM) directly following gavage of a high fat meal (0.5 mL). 
Postprandial triglyceridemia was measured as described previously. Each point represents the 
mean ± SEM (A: saline, n = 8, leptin, n = 7; leptin peptide: saline, n = 8, leptin, n = 8). The 

20 insert to figure 1 IB shows the effect of mouse leptin on LSR activity in primary cultures of rat 
hepatocytes. Primary cultures of rat hepatocytes were incubated 30 min at 37 °C with increasing 
concentrations of mouse leptin peptide. Oleate (0.5 mM) and I25 I-LDL (20 ng/mL) were added 
and the cells were incubated 2 h at 37 °C. After washing, the amount of 1251-LDL bound to the 
cell surface was measured as described in the Examples section herein. 

25 Figure 1 2 contains a Table that presents results showing the effect of test meal with and 

without leptin injection on postheparin lipolytic activity in db Pa 7db Pas mice. Animals were 
gavaged with the test-meal and injected at the same time with 50 \xg leptin or physiological 
saline as described previously (t = 0 h). After I h, the mice were injected with heparin and blood 
samples were taken at the peak of postprandial triglyceridemia (t = 2 h). Lipase activity was 

30 measured in the postheparin plasma as described in the Examples section herein,, and is reported 
here as the mean ± SEM (n = 3 animals for each condition; ns = not significant). 

Figure 13 shows a multiple sequence alignment of leptin polypeptides from various 
species including: Homo sapiens (SEQ ID NO:32), Mus musculus (SEQ ID NO:34), Rattus 
norvegicus (SEQ ED NO:38), Sus scrofa (SEQ ID NO:39), Bos Taurus (SEQ ID NO:28), Gallus 

35 gallus (SEQ ID NO:30), Ovus aries (SEQ ID NO:35), Canis familiaris (SEQ ID NO:29), Gorilla 
gorilla gorilla (SEQ ID NO:3 1), Macaca mulatto (SEQ ID NO:33), Pan troglodytes (SEQ ID 
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NO:36), and Pongo pygmaeus (SEQ ID NO:37). Divergent residues (from the consensus 
sequence) are boxed. The 22 amino acid region of the exemplary active leptin peptide is shaded 
for all species in the alignment. Residues 10-13 of the shaded region make up the "leptin 
fragment central sequence". 
5 Figure 14 shows a schematic diagram of an exemplary retroviral vector. The vector 

pMX-IRES-GFP contains the murine Moloney virus LTR and a packaging signal (Onishi et aL 
Exp. Hematol. 24: 324-329 (1996)); the EMCV IRES is placed between the polylinker/stuffer 
and a cDNA encoding a selectable marker protein. Three exemplary marker proteins are GFP, 
murine CD2 and human CD4. The IRES sequence is indicated as a shaded box with an arrow 

1 0 indicating the direction of translation. The segment containing the bacterial origin of replication 
and ampicillin resistant gene is indicated by a black box. The stippled box represents sequence 
encoding the green fluorescent protein; alternatively it can contain the truncated CD2 or CD4 
sequences. Open boxes with arrows indicate the viral LTR sequences. The open box indicates a 
stuffer fragment containing multiple cloning sites. 

1 5 Figure 15 shows a schematic of a plan to create truncated LSR constructs. 

Figures 16A, 16B, and 16C show that the transfection of a truncated form of LSR (DN5 
+ 6) increases 125 I-LDL binding (A), uptake (B) and degradation (C) in PLC cells in reference to 
protein concentration. All points are done in triplicate. 

Figures 17A, 17B, and 17C show that the transfection of a truncated form of LSR 

20 (DN5+6) increases 125 I-LDL binding (A), uptake (B) and degradation (C) in PLC cells correcting 
for transfection efficiency using p-gal as a reference. All points are done in triplicate. 

Figures 18A and 18B show graphs of the total LSR expression in mouse liver determined 
by Quantitative PGR. 

Figures 19A and 19B show graphs of the expression of LSR isotypes in mouse liver. 

25 Figures 20A and 20 B show graphs of the relative levels of LSR isotype expression in 

mouse liver. 

Figure 21 shows a graph of total LSR expression in mouse brain determined by 
Quantitative PCR. 

Figure 22 shows a graph of the expression of LSR isotypes in mouse brain. 
30 Figure 23 shows a graph of the relative levels of LSR isotype expression in mouse brain. 

Figures 24 A, 24B, 24C, 24D, and 24E show the difference in LSR expression and 
activity in 2 cultured human hepatocyte cell lines. Figures 24A and 24B show graphs of LSR 
mRNA levels (24A) and cell surface expression (24B) in PLC (GG) and HepG2 (AG) cells by 
quantitative PCR and FACS, respectively. Figures 24C, 24D, and 24E show graphs of the 
35 oleate-induced 125 I-LDL bound (A), internalized (B), and degraded (C) in confluent monolayers 
of PLC (■) and HepG2 (71) that were incubated 3 h at 37 °C with the indicated concentrations of 
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oleate and 20 jig/mL ,25 I-LDL. The cells were then washed and the amounts of 125 I-LDL bound, 
internalized and degraded were measured as described previously. 

Figure 25 shows a table of the characteristics of recombinant ZFPs directed toward LSR 
sequences. The first column is the identification number of the plasmid expressing a specifically 
5 engineered ZFP. The ZFP column represents different zinc finger "cassettes" designed to 
recognize the 9 bp regions of the target sequence. These "cassettes" have then been linked 
together (see WO 98/5431 1) to create the ZFP for the final 18 bp target sequence listed in the 
final column. Sangamo determined the data on the fold activation and binding constant. The 
target sequences are located 5' to the translation start site in the mouse LSR gene sequence. 

10 Figures 26A, 26B, 26C, 26D, 26E, 26F, 26G, 26H, 261, 26J, 26K, 26L, 26M, 26N, 260, 

26P, 26Q, 26R, 26S, and 26T show schematics and nucleotide sequence of the LSR zinc finger 
plasmids pSBS5 1 82-NVF (26A), pSBS5 1 83-NVF (26B), pSBS5 1 85-NVF (26C), pSBS5 1 86- 
NVF (26D), and pSBS5205-NVF (26E). The locations of the ampicillin gene (Amp), neomycin 
gene (Neo) CMV promoter NLS, ZFP, VP 16, FLAG, bGHpA as well as various restriction sites 

15 are shown in the schematics. 

Figure 27 shows a Northern Analysis of LSR zinc finger mRNA expression. Numbers 
are shown as percent of control plasmid. Only the results from 48 hrs are shown. 

Figure 28 shows a more detailed Northern analysis of LSR zinc finger mRNA 
expression. Numbers are shown as percent of control plasmid. Only the results from 48hrs are 

20 shown. 

Figure 29 shows a quantitative PCR Analysis of Hepal-6 cells transfected with ZFP- 
NVF constructs. 

Figures 30A, 30B, 30C, 30D, 30E, and 30F show binding, uptake and degradation 
(BUD) data from ZFPs. The following ZFP's were examined: 5 1 85-NVF, 5 1 86-NVF, and 
25 control plasmid VegF-NVF (a non related ZFP). Results are corrected for total protein in A-C 
and for P-gal in D-F. 

Figure 31 shows a diagram of the coculture system. Endothelial cells are plated in the 
upper compartment on the filter and astrocytes in the lower compartment on the plastic of the 
Petri dish. 

30 Figure 32 shows a diagram of transcytosis and permeability studies. 

Figure 33 shows a graph of leptin transcytosis in BBB in vitro model. Cells were 
incubated with l25 I-leptin alone (10,000 dpm/ng)(closed squares), with 1 jig/mL unlabelled leptin 
(triangles), 50 ng/mL MP (circles), 50 ng/mL HP (open squares), or 2 mg/mL lactoferrin 
(asterisks). 

35 Figure 34A and 34B show graphs of the effect of leptin, MP, HP, and lactoferrin on the 

permeability of the EC monolayer. Sucrose (34A) and inulin (34B) permeability studies were 
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performed in the absence (diamonds) or presence of 10 ng/mL leptin (squares), 5 ng/mL leptin 
(triangles), 10 ng/mL leptin (crosses). The effect of peptides were also tested by the addition of 
10 ng/mL leptin + 50 ng/mL mouse peptide (MP, circles) or 10 ng/mL leptin+50 jig/mL human 
peptide (HP, open squares) or 10 ng/mL leptin+2 mg/mL lactoferrin (lacto, asterisks). 
5 Figures 35A and 35B show graphs of LSR activity and mRNA expression measured in 

PLC cells preincubated 24 h with leptin. In Figure 35A, PLC monolayers were incubated 24 h at 
37 °C with (o) or without (n) 200 ng/mL human recombinant leptin. After washing with PBS, 
cells were incubated 30 min at 37 °C with increasing concentrations of human leptin, followed 
by a 2 h incubation at 37 °C with 0.8 mM oleate and 20 jig/mL l25 I-LDL. Cells were washed, 

1 0 and the amount of oleate-induced ,25 I-LDL binding was measured as described previously. 
Results are shown as the mean of triplicate determinations. In Figure 35B, PLC monolayers 
were incubated 24 h at 37 °C with 0, 200, or 400 ng/mL human recombinant leptin. After 
washing with PBS, the cells were harvested. Total RNA was prepared from the cell pellets, and 
Northerns were performed to detect LSR mRNA, using GAPDH probe as loading control as 

1 5 described previously. Northern blots were scanned on the Phosphorimager (Molecular 

Dynamics, Sunnyvale, CA). Densitometric analysis of the images was performed using the 
software ImageQuant. Results are shown as the amount of LSR signal relative to that of 
GAPDH (mean ± SD, n = 3/condition). 



20 EXAMPLES 

The following Examples are provided for illustrative purposes and not as a means of 
limitation. One of ordinary skill in the art would be able to design equivalent assays and 
methods based on the disclosure herein all of which form part of the instant invention. 



25 GENERAL MATERIALS and METHODS 
Materials 

125 

Na I was purchased from Amersham-Pharmacia (Piscataway, NJ; Les Ulis, France). 
Oleic acid, bovine serum albumin (A2153)(BSA), were obtained from Sigma (St. Louis, MO; St. 
Quentin Fallavier, France). Sodium heparin was purchased from Choay laboratories (Gentilly, 

30 France). Fugene was purchased from Roche Boehringer Mannheim (Indianapolis, IN), and 
Superfect from Qiagen (Valencia, CA). Zeocin was obtained from Invitrogen (Carlsbad, CA). 
Suramin was a gift from Bayer Pharmaceuticals (Puteaux, France). Enzymatic kits for the 
determination of TG and FFA were obtained from Roche-Boehringer Mannheim (Meylan, 
France) and WAKO (Richmond, VA; Unipath, Dardilly, France), respectively. Dulbecco's 

35 modified Eagle's medium (DMEM), trypsin, penicillin-streptomycin, glutamine, and fetal bovine 
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serum (FBS) were purchased from Life Technologies, Inc (Grand Island, NY; Eragny, France). 
RIA kits for plasma leptin measurements were obtained from Linco (St. Louis, MO). 
Experiments in Figures 1 (db/db only), 2 and 6 were performed using recombinant mouse leptin 
prepared in the laboratory as described previously (Yen F.T., Masson M, Clossais-Besnard N., 
5 Andre P., Grosset J.M., Bougueleret L., Dumas J.B., Guerassimenko, O., and Bihain B.E. 
(1999). JBiolChem 274, 13390-13398). 

The remainder of the experiments were performed using commercial preparations of 
recombinant human or mouse leptin (Sigma and Calbiochem, Meudon, France). <x 2 - 
1 0 Macroglobulin-methylamine was a kind gift from Dr. D. Strickland (American Red Cross, 
Rockville, MD). 

Animals 

Male wild-type and C57BL/Ks db/db (db) mice were purchased from R. Janvier 
1 5 Breeding Center (Le Genest St. Isle, France), while male db Pas /db Pa * were kindly made available 
by Prof. J.L. Guenet (Institut Pasteur, Paris, France). Female ob/ob mice were obtained from 
The Jackson Laboratory (Bar Harbor, Maine ). All animals were housed in an animal facility on 
a 12 h light/dark cycle and were allowed water and rodent chow (No. 1 13, UAR, Epinay-sur- 
Orge, France) ad libitum. Mean body weights at the time of the experiment for wild-type, db/db, 
20 db Pa */db Pa \ and ob/ob mice were 27.8 ± 1 .4, 33.8 ± 9, 74.6 ± 1 1 .4 g, and 49.4 ± 5.49 g, 

respectively. The research protocol was in accordance with French Ministry of Agriculture, 
section of Health and Animal Protection and the established institutional guidelines. 

Cells 

25 Primary cultures of rat hepatocytes were prepared as described previously (Yen, F.T., 

Mann, C.J., Guermani, L.M., Hannouche, N.F., Hubert, N., Hornick, C.A., Bordeau, V.N., 
Agnani, G., and Bihain, B.E. (1994). Biochemistry 35, 1 172-1 180). using overnight-fasted 150- 
200 g Sprague-Dawley male rats (R. Janvier Breeding Center) or obtained commercially (InVitro 
Technologies, Baltimore, MD). Cells were used in experiments 48 h after plating. The PLC 

30 liver hepatoma (CRL-8024) and Chinese hamster ovary (CHO-K 1 , CRL 961 8) cell lines were 
obtained from the ATCC repository (CRL-8024; Manasass, VA). The PLC line was maintained 
in tissue culture with MEM containing 10% (v/v) FBS, 2 mM glutamine, sodium pyruvate, non- 
essential amino acids, 100 units/mL penicillin, and 100 units/mL streptomycin. CHO-K 1 cells 
were grown in Ham's-F12 containing 10% (v/v) FBS, 2 mM glutamine and 100 units/mL each of 

35 penicillin and streptomycin. 
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Anti-LSR antibodies and peptides 

The preparation of antibodies directed against rat LSR protein, and anti-LSR peptide 170 
antibodies was as described previously (Yen F.T., Masson M, Clossais-Besnard N., Andre P., 
Grosset J.M., Bougueleret L., Dumas J.B., Guerassimenko, O., and Bihain B.E. (1999). J Biol 
5 Chem 274, 13390-13398. Synthetic peptides 81B and 93A with sequences corresponding to 
human LSR a residues 35-45 (FGRDARARRAQ) and 613-627 (EEAYYPPAPPPYSET), 
respectively, were obtained commercially. Polyclonal antibodies directed against this synthetic 
peptide conjugated to KLH were prepared, and the IgG fraction was purified as described 
previously (Yen F.T., Masson M., Clossais-Besnard N., Andre P., Grosset J.M., Bougueleret L. 
10 Dumas J.B., Guerassimenko, O., and Bihain B.E. (1999). J Biol Chem 274, 13390-13398.) 
Synthetic peptides corresponding to residues 1 17-138 of mouse leptin 
(CSLPQTSGLQKPESLDGVLEAS) as well as the corresponding fragment of human leptin 
were commercially prepared (Research Genetics, Huntsville, AL). 



15 In Vivo Methods 

Measurement of plasma lipid response in mice 

Mice that were fasted for 2-3 hours were gavage-fed 300 \iL of a test meal consisting of 
60% fat (37% saturated, 27% mono-, and 36% polyunsaturated fatty acids), 20% protein and 
20% carbohydrate, and providing 56 kcal of energy/kg (1.5 g butter, 1.5 g sunflower oil, 2.5 g 

20 nonfat dry milk, 2.5 g sucrose and 3 ml water). Immediately after the meal, the animals were 

injected intravenously {db/db) or intraperitoneally (db Pns /db Pas ) with either 200 ^iL physiological 
saline or 200 \xL of the same solution containing recombinant mouse leptin. At selected time 
intervals, 20 \ih of blood were collected from the orbital (db Pas /db Pas ) or tail (db/db) vein into 
ice-cold microfuge tubes containing 4 mmol/L EDTA. Plasma was obtained by centrifugation at 

25 2500 rpm for 20 min at 4 °C, and was frozen as aliquots at —80 °C before analysis. TG 

concentrations were determined using a commercially available enzymatic kit with controls 
included in each assay (Precinorm L, Roche-Boehringer Mannheim; Lyotrol N, BioMerieux). 

Measurement of postheparin lipolytic activity 

30 Mice were gavage-fed and injected with leptin or control solutions as described above. 

At t = 1 h, the mice were injected subcutaneously with heparin (100 IU/kg body weight). At t = 
2 h, the animals were bled and the plasma was immediately separated by centrifugation. Lipase 
activity was determined according to Iverius and Brunzell (1985) using 20% Lipoven (Fresenius 
France Pharma, Louviers, France) as the source of TG. The assay was performed using 25 ^iL 

35 postheparin plasma in 0.15 M NaCl (200 \iL total volume), and in the presence of 10 nL heat- 
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inactivated (56 °C, 30 min) human plasma as a source of apoC's. Before and at the end of the 
incubation, FFA concentrations were determined using an enzymatic kit. 

Cell Culture Studies 
5 Lipoprotein receptor studies 

LSR activity was measured as the oleate-induced binding, uptake, and degradation of 125 I-low 
density lipoprotein (LDL) in cells following the method described in detail previously ( , 
B.E., and Yen, F.T. (1992). Free fatty acids activate a high-affinity saturable pathway for 
degradation of low-density lipoproteins in fibroblasts from a subject homozygous for familial 

10 hypercholesterolemia. Biochemistry 31, 4628-4636; Yen, F.T., Mann, C.J., Guermani, L.M. 
Hannouche, N.F., Hubert, N., Hornick, C.A., Bordeau, V.N., Agnani, G., and Bihain, B.E 
(1994). Biochemistry 33, 1 172-1 180) ; Yen F.T., Masson M, Clossais-Besnard N., Andre P., 
Grosset J.M., Bougueleret L., Dumas J.B., Guerassimenko, O., and Bihain B.E. (1999). J Biol 
Chem 274, 13390-13398). Modifications of the standard protocols are described in the Brief 

1 5 Description of the Drawings. 

Identification Of LSR Protein 
Western blotting 

Confluent monolayers of cells were washed in PBS, and lysed in 20 mM Tris containing 
20 2 mM EDTA and 0.5 % (w/v) SDS and an protease inhibitors (0.1 mg/mL PMSF, 2 ng/mL 

leupeptin and 1 .9 ng/mL aprotinin). The lysate was then separated on 10% SDS-PAGE under 
denaturing conditions. After transfer to nitrocellulose, the strips were probed with anti-LSR 
peptide anti-serum. Bands were revealed after incubations with secondary goat anti-rabbit IgG 
conjugated to alkaline phosphatase. After washing in PBS containing 0.5% (v/v) Tween 20, the 
25 bands were revealed by incubation with substrate. 

Immunoprecipitation 

Confluent monolayers of PLC cells were lysed in PBS containing 1% (w/v) Triton X-100, and 
then were incubated with the specified anti-LSR antibodies, as described previously (Yen F.T., 
Masson M., Clossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., Dumas J.B., 
30 Guerassimenko, O., and Bihain B.E. (1999). J Biol Chem 274, 13390-13398). 

Immunoprecipitates were separated on 10% SDS-polyacrylamide gels under nondenaturing 
conditions, and then transferred onto nitrocellulose. 



Ligand blotting 

Partially purified rat LSR (240 kDa band complex) was obtained as described previously 
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(Yen F.T., Masson M, CIossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., Yen FT., 
Masson M., CIossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., Dumas J.B., 
Guerassimenko, (X, and Bihain B.E. (1999). J Biol Chem 274,13390-13398.) The band was 
separated on non-denaturing 4-12% gradient SDS polyacrylamide gel, and was transferred to 
5 nitrocellulose by semi-dry transfer (Biorad, 18 V, 25 min). The nitrocellulose strip was 

incubated at room temperature with PBS containing 3% BSA, and then incubated at 37 °C for 1 
h 

with 200 ng/mL 1251-leptin in PBS containing 0.2% BSA, pH 7.4. After six 10 min washes in 
PBS containing 0.5% TritonX-100, the strip was air-dried and exposed on a phospohor screen 
10 for 
analysis. 

Preparation of lipoproteins 

Human LDL (1.025 < d < 1.055 g/mL) were isolated by sequential ultracentrifugation of 
fresh plasma obtained from the local blood bank (Havel, R., and Kane, J.P. (1995). In The 
Metabolic and Molecular Basis of Inherited Disease, vol. n, Scriver, C.R., Beaudet, A.L., 
Sly, W.S., and Valle, D., eds. (New York, NY: McGraw-Hill, Inc), pp. 1841-1851. 

Rat chylomicrons were prepared from overnight-fasted male Sprague-Dawley rats (300- 
400 g) given a high-fat liquid meal similar to that given to mice (2 mL per animal). After 45 
min, the animals were anesthetized and catheters were inserted in the main abdominal lymph 
duct. Lymph was collected over 2 hours, and the chylomicrons were isolated. Contaminating 
albumin was removed by incubation for 30 min at room temperature with an equivalent volume 
of swollen Blue Sepharose CL-6B gel (Amersham Pharmacia Biotech) (Mann, C.J., Troussard, 

A. A., Yen, F.T., Hannouche, N., Najib, J., Fruchart, J.-C., Lotteau, V., Andre, P., and Bihain, 

B. E. (1997). J. Biol. Chem. 272, 31348-3 1354). All lipoproteins were stored in the dark at 4 °C 
under N2 and used within 2 weeks (LDL) or 3 days (chylomicrons) of their isolation 

Radiolabelling 

Lipoproteins were radioiodinated using Bilheimer's modification of the McFarlane's 
procedure (Bilheimer, D.W., et al. (1972). Biochim. Biophys. Acta 260, 212-221), and used no 
30 more than 1 week after radiolabeling. ,25 I-LDL was filtered (0.2 >im, Gelman, Ann Arbor, MI) on 
the day of the experiment. 

Leptin was iodinated using Iodobeads (Pierce) according to the manufacturer's 
instructions. 
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Cloning Of Full Length cDNA Human LSR 

Human homologous sequences of rat LSR cDNA were found with 2 partially 
overlapping human genomic sequences (Genbank accession nos: AD000684 and AC002128). 
ESTs generated on the basis of these sequences were used to screen a human BAC library. A 
5 single clone was isolated and sequenced. Analysis of this sequence revealed several variations 
from the public sequence. A revised LSR sequence is currently available in Genbank (accession 
numbers TBA). 

An 805 bp fragment was obtained by PCR amplification of human liver mRNA (Sense 
primer: 5'- CTACAACCCCTACGTCGAGT (SEQ ID NO:22), antisense primer: 5' - 

1 0 AGGCGGAGATCGCCAGTCGT (SEQ ID NO:23)), and subcloned into the TA cloning vector 
(Invitrogen, Carlsbad, CA). The cloned insert was isolated by digestion with EcoRl, was 
purified (GenClean kit, Bio 101, Vista, CA), and the DNA was labeled with a- 33 P-dCTP (NEN, 
Boston, MA) using the random primers labeling system (Life Technologies). The labelled 
fragment was used to screen the cDNA library (Superscript, Life Technologies), from which we 

15 obtained a partial a' clone (clone 18251), lacking 161 bp of the 5' region. 

The missing 5* region was obtained by PCR amplification (AmpliTaq, Promega, 
Madison, WI) from a first strand cDNA prepared from human liver total RNA (Clonetech, Palo 
Alto, CA)(both oligo dT and random primers were used). The primers for PCR were sense 
5 'CCTTTGTCCACGTCGTTTACGCTC-3 ' (SEQ ID NO:24) and antisense 5'- 

20 TCACAGCGTTGCCCTGCTTG -3' (SEQ ID NO:25). The PCR was performed with annealing 
temperature of 65 °C and 35 cycles. The fragment was cloned into pGEMT-Easy Vector 
(Promega). 

Fragments corresponding to the a forms and P were cloned into pGEMT-Easy Vector 
and then used to replace the appropriate region in the LSR a' clone. The full-length LSR a, a', 
25 and p clones were reconstructed in pTracer-CMV2 vector (Invitrogen) using EcoRI/Xba I. 

PCR Analysis of Human LSR 

Similarly to previous results with rat LSR (Yen FT., Masson M., Clossais-Besnard R, Andre P., 
Grosset J.M., Bougueleret L., Dumas J.B., Guerassimenko, O., and Bihain B.E. (1999). J Biol 

30 Chem 274 y 13390-13398), two splice variants of LSR were detected by RT-PCR analysis of 
human hepatocyte cDNA. In Fig. 3B, sense and antisense primers were designed to yield three 
products, of which two were the splice variants. The primer sequences were: sense, 5'- 
TTACTGCTCCGTGGTCTCAGC- 3' (SEQ IDNO:26) and antisense, 5' 
AGCTACTCCTGTCAACGTCTCC - 3' (SEQ ID NO:27). Identities of each band were 

35 confirmed by sequencing. 
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Northern Blotting 

Northern blots were performed as described previously using as a probe clone 18251 described 
above (Yen F.T., Masson M., Clossais-Besnard N., Andre P., Grosset J.M, Bougueleret L., 
Dumas J.B, Guerassimenko, O., and Bihain B.E. (1999). J Biol Chem 274, 13390-13398). 

5 

I n vitro translation 

In vitro translation products were obtained using 35 S-methionine (Amersham) and the T7 
coupled transcription/translation kit from Promega. 

Transient Transfection Studies 

1 0 CHO-K1 cells were plated at a density of 300,000 cells/36 mm dish the day before 

transfection. After 24 h, plasmid preincubated with Fugene transfection reagent was added to 
the cells, which were further incubated at 37 °C. Cells were used 48 h after transfection as 
described in the Brief Descriptioon of the Figures. 

Stable Transfections 

1 5 Stable transfectants were prepared from CHO-K1 cells using Superfect according to the 

manufacturer's instructions. After introduction of the plasmid into the cell with Superfect, the 
cells were grown in the presence of 750 ng/mL zeocin. After elimination of untransfected cells, 
the antibiotic concentration was reduced to 500 ng/mL. Clones were isolated using cloning 
cylinders, and maintained in tissue culture media containing 100 ng/mL zeocin. 

20 FACS Analysis 

Flow cytometry is a laser-based technology that is used to measure characteristics of 
biological particles. The underlying principle of flow cytometry is that light is scattered and 
fluorescence is emitted as light from the excitation source strikes the moving particles. 

Assay 1 :PLC cell suspensions were obtained using non-enzymatic dissociation solution 

25 (Sigma), and then were incubated for 1 h at 4 °C with a 1 :200 dilution of anti-LSR 8 IB or 
irrelevant anti-serum in PBS containing 1% (w/v) BSA. After washing twice with the same 
buffer, goat anti-rabbit FITC-conjugated antibody (Rockland, Gilbertsville, PA) was added to the 
cells, followed by a further incubation for 30 min at 4 °C. After washing, the cells were fixed in 
2% formalin. Flow cytometry analysis was done on a FACSCalibur cytometer (Becton- 

30 Dickinson, Franklin Lakes, NJ). 
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Assay 2 : Cells are cultured in a T175 flasks according to manufacturer's instructions 
for 48 hours prior to analysis. 

Cells are washed once with FACs buffer (lx PBS/2% FBS, filter sterilized), and 
manually scraped from the flask in 10 mLs of FACs buffer. The cell suspension is transferred to 
5 a 15 mL conical tube and centrifuged at 1200 rpm, 4 °C for 5 minutes. Supernatant is discarded 
and cells are resuspended in 10 mL FACs buffer chilled to 4 °C. A cell count is performed and 
the cell density adjusted with FACs buffer to a concentration of 1 xlO 6 cells/ mL. One milliliter 
of cell suspension was added to each well of a 48 well plate for analysis. Cells are centrifuged at 
1200 rpm for 5 minutes at 4 °C. Plates are checked to ensure that cells are pelleted, the 

1 0 supernatant is removed and cells resuspended by running plate over a vortex mixer. One 
milliliter of FACs buffer is added to each well, followed by centrifiigation at 1200 rpm for 5 
minutes at 4 °C. This described cell washing was performed a total of 3 times. 

Primary antibody, titered in screening experiments to determine proper working dilutions 
(for example 1:25, 1:50, 1:100, 1:200, 1:400, 1:500, 1:800, 1:1000, 1:2000, 1:4000, 1:5000, or 

1 5 1 : 10000), is added to cells in a total volume of 50 pL FACs buffer. Plates are incubated for Ih 
at 4 °C protected from light. Following incubation, cells are washed 3 times as directed above. 
Appropriate secondary antibody, titered in screening experiments to determine proper working 
dilutions (for example 1:25, 1:50, 1:100, 1:200, 1:400, 1:500, 1:800, 1:1000, 1:2000, 1:4000, 
1:5000, or 1:10000), is added to cells in a total volume of 50 pL FACs buffer. Plates are 

20 incubated for lh at 4 °C protected from light. Following incubation, cells are washed 3 times as 
directed above. Upon final wash, cells are resuspended in 500 \iL FACs buffer and transfered to 
a FACs acquisition tube. Samples are placed on ice protected from light and analyzde within 1 
hour. 

25 Protein Determinations 

Protein concentrations were determined using Markwell's modified Lowry procedure 
(1981) or BCA protein assay (Pierce Chemical Co, Rockford, IL) and BSA as standard. 

Statistical Analysis 
30 Results were analyzed using unpaired Student's t-test. 

EXAMPLE 1: Effect of Leptin on Postprandial Plasma TG Response 

Transient hypertriglyceridemia seen after administration of a test meal in two strains of 
obese mice with defects of the Ob-Receptor (OB-R) is shown in Figures 1A and IB (open 
35 symbols). The db/db mice present a mutation of the Ob-Rb isoform, preventing signaling to the 
JAK and Stat system, while the db Pas /db Pas lack any leptin signaling capacity through the Ob-R. 
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Similar to what is observed in most obese human subjects (Lewis, G.F., O'Meara, N.M., Soltys, 
P. A., Blackman, J.D., Iverius, P.H., Druetzler, A.F., Getz, G.S., and Polonsky, K.S. (1990) /. 
Clin. Endocrinol. Metab. 71, 1041-105o ; Vansant, G., Mertens, A., and Muls, E. (1999) Intl. J. 
Obesity 23, 14-21) postprandial plasma lipid levels were elevated in both strains of obese mice 
5 when compared to lean controls (shown as dotted lines). A single bolus injection of 50 ng leptin 
at the time of the meal decreased the amplitude of the triglyceride response (Fig. 1 A and IB, 
closed symbols); this effect could not be attributed to a reduction in food intake since the meal 
was administered by intragastric cannulation. 

A significant reduction of the area under the TG curve was observed with 250 ng of 

1 0 leptin per animal (Fig. 1 A, IB, insets). It can be estimated (average body weight of dti^/db Pas , 
74.6 ± 1 1 .4 g; plasma volume 45 mL per kg) that this dose cannot cause more than a two-fold 
increase of the concentration of circulating leptin (86.7 ± 12.2 ng/mL) in db pas /db Pas . Maximum 
effect of leptin was achieved with 500 ng per animal which decreased by > 80% and > 65% the 
area under the postprandial TG curve in db/db and db Pai /db Pa \ respectively. This dose of leptin 

15 (7 |ig per kg body weight) is 1 5-fold lower than that used to achieve 30 to 40% reduction of food 
intake after peripheral administration of leptin (Campfield, L.A., Smith, F.J., Guisez, Y., Devos, 
R., and Burn, P. (1995) Science 269, 546-549; Halaas et al, 1995; Halaas, J.L., Gajiwala, K.S., 
Maffei, M, Cohen, S.L., Chait, B.T., Rabinowitz, D., Lallone, R.L., Burley, S.K., and, J.M. 
(1995) Science 269, 543-546; Pelleymounter, M.A., Cullen, M.J., Baker, M.B., Hecht, R., 

20 Winters, D., Boone, T., and Collins, F. (1995) Science 269, 540-543). These data establish that 
leptin can control the exogenous lipoprotein pathway and that this regulation occurs in 
db^'/db** 5 in spite of the complete defect of the Ob-R. 

EXAMPLE 2: Leptin Binding to Rat LSR 

25 The binding of leptin to LSR was tested using partially purified rat LSR multimeric 

complexes. Complexes separated by SDS electrophoresis (Fig. 2, lane 1) and transferred to 
nitrocellulose, bound l25 I-leptin (Fig. 2, lane 3). The same bands were recognized by polyclonal 
anti-rat LSR antibodies (Fig. 2, lane 2). The specificity of these antibodies has been described 
previously (Yen F.T., Masson M., Clossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., 

30 Dumas J.B., Guerassimenko, O., and Bihain B.E. (1999). J Biol Chem 274, 13390-13398). 

EXAMPLE 3 : Effect of LSR Subunit Transfection 

To determine which of the LSR subunits is responsible for leptin binding, CHO-K1 cells 
were transiently transfected with increasing concentrations of each of the 3 human LSR plasmids 
35 (Fig. 3 A). CHO-K1 cells were selected because they had the lowest level of endogenous LSR 
expression of the different cell lines tested. This level is far lower than that of a human 
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hepatocyte cell line (PLC) used to systematically characterize human LSR activity (Fig. 3B). 
The data showed that only transfection with the LSR a' plasmid increased the binding of leptin 
to CHO-K1 cells (Fig. 3 A). Leptin binding to CHO-K1 cells transfected with LSR p or a 
remained at levels similar to those seen with the vector alone. Analysis of the expression of co- 
5 transfected green fluorescent protein (GFP) estimated transfection efficiency at ± 25% for all 3 
transiently transfected plasmids. 

CHO-K1 cells stably expressing LSR a' were also obtained and were determined to have 
an increased 125 I-leptin binding and uptake (Fig. 3C). The apparent molecular mass of human 
LSR a' in stable CHO-K1 transfectant cells corresponded to that of the smallest LSR subunit (~ 
10 70 kDa) in PLC cells (Fig. 3B). Lineweaver-Burk transformation of leptin binding to CHO-K 1 
cells expressing LSR a' yielded an estimated Kd of 1 .3 nM (Fig. 3C, inset), ~ 2 fold that of the 
Ob-R (Kd = 0.7 nM; Tartaglia et al, 1995). Leptin binding to LSR a' led to its internalization 
and proteolytic degradation consistent with this leptin binding reflecting a biologically relevant 
process (Fig. 3D). 

1 5 Similar to what is observed in cells transfected with the Ob-Ra or Ob-Rb 

(Uotani, S., Bjerbaerk, C, Tornse, J., and Flier, J.S. (1999). Diabetes 48, 279-286.) the amount 
of ,25 I-leptin degraded by CHO-K1 cells transfected with LSR a' represented only 1 6% of that 
bound and internalized by the cells. These rates of ,25 I-leptin degradation are much lower than 
those observed with receptors mediating rapid endocytosis (Goldstein, J.L., Basu, S.K., Brown, 

20 M.S. (1983). 98, 241-260). For instance, after 2 h incubation, the amount of ,25 I-LDL degraded 
through LSR represents 4-5 times the amount bound to the cell surface (Bihain, B.E., and Yen, 
F.T. (1992). Although not intending to be limited by any particular theory, the simplest 
explanation is that LSR a' lacks the di-leucine routing signal known to trigger rapid lysosomal 
delivery. The LSR a contains such a signal, consistent with previous observations that the a 

25 subunit is a critical element allowing LSR to function as lipoprotein receptor (Yen F.T., Masson 
M., Clossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., Dumas J.B., Guerassimenko, 
O., and Bihain B.E. (1999). J Biol Chem 274,13390-13398). 

Similar experiments are performed in the other stable cell lines expressing the subunits 
of LSR alone or in all combinations (see table, below). These cell lines are useful for screening 

30 small molecules or any potential agonist or antagonist for activity against either the leptin or 
triglyceride (or both) activity of LSR. In addition, they can be employed in receptor binding 
assays using FACS analysis or radiolabeled ligands to identify additional ligands of LSR. 
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LSR stable-transfectant Cell Lines 

CHO LSR alpha 

CHO LSR alpha' 
CHO LSR beta 

CHO LSR alphaVbeta — 
CHO LSR alpha/beta ~~ ~~ 
CHO LSR alpha/alpha 7 
CHO LSR alpha/alphaVbeta ™~ 



EXAMPLE 4 : Effect of 8 IB anti-LSR Antibody on LSR Binding and Degradation of Leptin 
To test whether in nontransfected cells leptin binds to LSR, PLC cell lysates were 
5 immunoprecipitated with an antibody directed against a synthetic peptide with a sequence 

identical to LSR residues 35-45 (8 IB). Ligand blotting showed that ,25 Heptin binds directly to 
the multimeric complexes (apparent molecular masses of 200 and 230 kDa) precipitated by the 
8 IB antibody (Fig. 4A). These complexes are of a size similar to that of rat LSR multimeric 
complexes (Yen F.T., Masson M., CIossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., 

10 Dumas J.B., Guerassimenko, O., and Bihain B.E. (1999). J Biol Chem 274, 13390-13398). 

Significant amounts of TCA-soluble chloroform-insoluble !25 I-leptin degradation products were 
found in the incubation media after 2 h incubation of PLC cells with increasing concentrations of 
125 Heptin (Fig. 4B, □). The dose response curve indicated that the process saturated for leptin 
concentrations ^ 10 ng/mL (Fig. 4B). The amount of leptin degraded per mg of PLC cell protein 

1 5 is about half as much as that degraded by CHO-K1 LSR a'stable transfectants (Fig. 3D). 

Chloroquine (50 jaM) inhibited 125 I-leptin degradation by more than 60%, while 
increasing the amount of cell-associated !25 I-leptin (2-4 fold). This is consistent with ,25 I-leptin 
degradation occurring in lysosomes after receptor-mediated endocytosis. The 81B antibody that 
immunoprecipitated LSR multimeric complexes had a profound inhibitory effect on leptin 

20 degradation in PLC cells (Fig. 4B, ■). This effect was maximal with 10 ng/mL of leptin and 200 
jig/mL of antibody and was partially competed off by increasing leptin concentrations at 20 
ng/mL. Because immunoprecipitation data revealed no interaction of the 8 IB antibody with the 
Ob-R or any other protein (Fig. 4 A), the inhibitory effect of this antibody on leptin degradation 
indicates that in cells of liver origin, the LSR is quantitatively the primary mechanism for leptin 

25 degradation. FACS analysis confirmed that the 8 IB anti-LSR antibody binds to non- 
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permeabilized PLC cells (Fig. 4D). This indicates that the amino-terminal is exposed on the cell 
surface. 

Leptin binding to LSR does not require the presence of FFA and is inhibited by the 81B 
antibody directed towards the LSR sequence located near the amino terminal end 
5 Immunoinhibition studies previously showed that the cluster of charged residues found at the 
carboxyl terminal end most likely represents the rat LSR lipoprotein binding site (Yen, FT., 
Masson M., Clossais-Besnard N., Andre P., Grosset J.M., Bougueleret L., Dumas J.B., 
Guerassimenko, O., and Bihain B.E. (1999). J Biol Chem 274, 13390-13398). Accordingly, 
LSR was classified as a type II membrane receptor. FACS analysis using the 170 antibody, 

10 directed towards a synthetic peptide with a sequence corresponding to that of LSR's carboxyl 
terminal end, is consistent with this interpretation (Fig. 4D). 

While not wishing to be limited by any theory, the observation that the 8 IB antibody 
inhibits leptin binding to LSR and binds to intact PLC cells (FACS analysis, Fig. 4D), suggests 
that LSR amino terminal ends are also exposed on the external side of the plasma membrane. 

1 5 LSR contains a typical 28 amino acid transmembrane spanning domain located between residues 
259-286 (Fig. 4C). In addition, a cluster consisting of 3 stretches of hydrophobic amino acids is 
located towards the amino terminal end. Each of these hydrophobic clusters is too short to allow 
crossing of the plasma membrane, but since the three hydrophobic elements are in close 
proximity with only two short hydrophilic separating clusters, a transmembrane spanning region 

20 could be constituted. In this case, the two separating hydrophilic domains would be oriented 
inwardly to minimize interaction with the hydrophobic moieties of the phospholipid bilayers. 
According to this model, LSR a and a' could cross the plasma membrane twice, with both 
carboxyl and amino terminal ends protruding extracellularly. LSR P would be limited to a single 
crossing of the membrane. 

25 

EXAMPLE 5 : Effect of Leptin on LSR Activity 

The effect of leptin on the activity of LSR with respect to its ability to bind, internalize 
and degrade lipoproteins was also studied. Leptin directly increased the oleate-induced LSR 
binding uptake and degradation of l25 I-LDL in a dose-dependent manner (Fig. 5A, 5B, 5C). The 
30 effect was observed at leptin concentrations > 10 ng/mL. 

The specificity of leptin's stimulatory effect upon LSR was further established by the 
observation that leptin at concentrations of up to 2 \xg/mL had no detectable effect on the 
degradation of LDL by the LDL-receptor nor on that of activated a 2 -macroglobulin, the preferred 
LRP ligand. 

35 The stimulatory effect of leptin on LSR activity as a lipoprotein receptor was suppressed 

by the 8 IB antibody (Fig. 5D). The antibody 170 directed against a rat LSR sequence located 
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towards the carboxyl terminal end was used as a control. Although the 170 antibody had an 
inhibitory effect on the oleate-induced 125 I-LDL binding in human PLC incubated without leptin, 
it did not prevent the leptin stimulatory effect on LSR activity (Fig. 5D). 

The stimulatory effect of leptin on LSR activity as lipoprotein receptor was seen not 
5 only in cells of human origin, but also in rodent hepatocytes. A brief, 30 min, preincubation of 
rat hepatocytes with 20 ng/mL mouse recombinant leptin at 37 °C increased oleate-induced !25 I- 
LDL binding to the cell surface in subsequent incubations at 4 °C (Fig. 6A), indicating that this 
stimulatory effect of leptin occurred rapidly. Northern blots showed that this leptin treatment did 
not increase mRNA levels significantly. Further, inhibition of cell protein synthesis (50 \iM 
1 0 cycloheximide) did not suppress the stimulatory effect of leptin, while microfilament inhibitors 
(50 nM cytochalasin B) reduced leptin stimulation by more than 80%. While not wishing to be 
limited by any particular theory, these results are consistent with the stimulatory effect of leptin 
on LSR activity resulting primarily from mobilization of a cryptic pool of receptors to the cell 
surface. 

1 5 Figure 6B shows the additive stimulatory effect of leptin and oleate on the binding of 

chylomicrons to rat hepatocytes. This leptin and oleate-induced binding of chylomicrons to rat 
hepatocytes was suppressed by specific polyclonal anti-LSR antibodies (Fig. 6C). Thus, the 
stimulatory effect of leptin on LSR is not limited to LDL, but extends to TG-rich lipoproteins 
that are directly responsible for the transport of dietary lipid. The data show that physiological 

20 amounts of leptin acutely regulate the removal of dietary TG by the liver, and that in vitro, the 
same concentrations of leptin regulate LSR activity in hepatocytes while leaving that of other 
lipoprotein receptors unchanged. 

The inhibition of the intestinal absorption of dietary lipids by leptin was also 
investigated. Overnight-fasted ob/ob mice were gavage-fed a high fat test meal. Immediately 

25 after the test meal (time = 0 h), the mice were injected intravenously with 200 ^iL saline 
containing either no supplement, 0.5 \ig recombinant mouse leptin, 2.5 mg lactoferrin, or a 
mixture of 0.5 jig leptin and 2.5 mg lactoferrin. Blood samples were taken at 2 and 3 h after the 
test meal, and plasma TG concentrations were measured (see Table, below). Values for these 2 
time points were pooled and are presented as means ± SD of quadruplicate determinations 

30 obtained in 2 different animals for each condition (* p < 0.02 (saline versus leptin; * p < 0.01 
saline versus lactoferrin; * NS (lactoferrin versus leptin + lactoferrin)). 

Table 

Effect of lactoferrin and/or leptin on the plasma lipid response of ob/ob mice 

Plasma TG 2-3 hours after test meal (mg/mL) 



Saline 



1.04 ±0.08 
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Leptin 0.79 ±0.1 * 

Lactoferrin 2.02 ± 0.26 1 

Leptin + Lactoferrin 1 .96 ± 0.42 5 



The amplitude of postprandial lipemia is determined by both the rate of intestinal lipid 
absorption and the rate of lipid clearance. To distinguish between these two possible sites of 
leptin regulation, we used lactoferrin, a milk protein that inhibits the removal of dietary lipid by 
5 the liver (Huettinger ,M., Retzek, H., Eder, M. and Goldenberg, H. (1988). Clin. Biochem. 
27,87-92). As shown in the Table, injection of lactoferrin in ob/ob mice caused a doubling of 
plasma TG measured during the postprandial stage. Further, leptin caused a decrease in 
postprandial plasma TG when injected without lactoferrin, but was unable to achieve a 
significant effect in mice simultaneously treated with lactoferrin. Although not wishing to be 

10 bound by a particular theory, this suggested that most of leptin's regulatory effect was due to 
stimulation of dietary lipid removal by the liver. Lactoferin has been shown previously to be an 
inhibitor of LSR at the concentration used (Yen, F.T., Mann, C.J., Guermani, L.M., Hannouche, 
N.F., Hubert, N., Hornick, C.A., Bordeau, V.N., Agnani, G., and Bihain, B.E. (1994) 
Biochemistry 33, 1 1 72-1 1 80; Mann, C. J., Khallou, J., Chevreuil, O., Troussard, A.A., 

15 Guermani, L.M., Launay K., Delplanque, B., Yen, F.T., and Bihain, B.E. (1995) Biochemistry 
54,10421-10431). 

The effect of leptin injection on the activity of lipolytic enzymes that are involved in the 
hydrolysis of plasma TG was also examined. Injections of leptin (50 ^ig/animal) did not 
significantly modify lipase activity released in serum of db Pa3 /db Pas after heparin injections (Fig. 
20 12). If anything, leptin decreased, although not significantly, the lipase activity when compared 
to the effect of administering the test meal alone. These data ruled out the possibility that leptin 
regulates postprandial lipemia by directly controlling the activity of lipolytic enzymes. 

EXAMPLE 6 : Comparison of the Effect of Human and Mouse Leptin 
25 To establish a link between leptin control of postprandial lipemia in mice and its 

stimulation of LSR in cultured cells, the species specificity in the ability of mouse and human 
leptin to activate LSR in cultured cells was utilized. Mouse leptin was more efficient than 
human leptin in stimulating LSR-mediated LDL degradation in primary cultures of rat 
hepatocytes (Fig. 7A); binding and uptake of l25 I-LDL followed a pattern superimposable to that 
30 of l25 I-LDL degradation. Conversely, human leptin was more efficient in stimulating LSR 
activity in human PLC cells than mouse leptin (Fig. 8B). 

The effect of human (1 ng/animal) and mouse (0.25 ng/animal) leptin on plasma TG 
response of db Pas /db p " mice was also compared. The data showed that human leptin slightly 
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reduced the postprandial plasma TG response (Fig. 7B, closed bar), but the effect did not reach 
statistical significance. This is consistent with the relative inability of human leptin to stimulate 
rodent LSR activity in cultured cells (Fig. 7A, closed bar). Mouse leptin injected at a 4-fold 
lower dose had a pronounced effect on postprandial plasma TG (Fig. 7B, hatched bar), consistent 
5 with its profound stimulatory effect on LSR in cultured cells (Fig. 7A, hatched bar). Thus, the 
effects of human and mouse leptin on postprandial TG response in obese mice paralleled their 
ability to stimulate LSR activity as lipoprotein receptor in cultured cells. Such species 
specificity has not been shown for the Ob-R. 

10 EXAMPLE 7 : Differential Effect of Mouse and Human Leptin and Leptin Peptide in Cells 
Species specificity has been observed with respect to leptin's ability to increase LSR 
activity in rodent or human liver cells (Fig. 8A and 8B). Mouse leptin increases LSR activity 
more in rat hepatocytes, and human leptin increases LSR activity more in human cells. In human 
cells the mouse leptin is inactive and almost approaches an inhibitory effect. 

15 An internal seqment of the leptin polypeptide that is near the carboxy terminus was 

found to differ significantly in different species (See shaded area in Fig. 13). The mouse and 
human sequence of this segment was synthesized as a 22-mer peptide and tested for activity in 
cells (Fig. 9 & 10). The human peptide was agonistic for LSR activity in human cells, while the 
mouse peptide was antagonist for LSR activity in human cells. Thus, the human leptin peptide 

20 has a complete signalling capacity in human cells (Fig. 9). In primary cultures of rat 

hepatocytes, both peptides increased oleate-induced LDL binding, though not to the same exten 
(at concentrations < 50 ixg/mL)t. However, there was an inhibitory effect on oleate-induced LDL 
degradation, indicating that these peptides do not completely mimic the activity of leptin in the 
rat system (Fig. 10). 

25 

EXAMPLE 8 : Effect of Mouse Leptin or Leptin Peptide on the Post-prandial Response 

The apparent Kd of LSR for leptin is in the same range as that of the Ob-receptor, 
suggesting that the regulation of LSR activity by leptin could represent a physiologically relevant 
process. To address this issue, the variation in plasma leptin concentration that occurs after 

30 administration of a test meal to normal mice was measured. Leptin concentrations of 1 .9 ± 0.7 
and 4.5 ± 0.2 ng/mL (p<0.007, n=4) were measured before and 2 h after the meal. However, in 
normal mice, the postprandial increase in plasma TG remained small and transient, even when 
massive amounts of dietary lipid were provided by intragastric cannulation. This reflects the fact 
that in normal mice, the rate of lipid clearance is adapted to that of intestinal absorption. 

35 Imbalance of this system appears to occur only in obese mice. However^ db pal /db Pas mice 

are not a satisfactory model to test the physiological effect of leptin. The plasma leptin levels of 
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these animals are extremely high (86.7 ± 12.2 ng/mL) and furthermore, do not detectably vary 
after administration of a test meal. Two hours after the test meal, leptin concentrations were 
measured as 86.6 ± 18.9 ng/mL (NS, n=5). Therefore, ob/ob mice that lack leptin were used to 
test whether administration of a physiological dose of leptin modulates postprandial lipemia. 
5 As seen in Fig. 1 1 A, a single subcutaneous injection of 50 ng of leptin in ob/ob mice 

decreases the postprandial lipemic response. This injection caused a transient increase in plasma 
leptin concentrations up to 3.25 ± 0.03 ng/mL at 2 h; baseline values were recorded 4 h after 
injection. The dose of leptin that is needed to control postprandial lipemia in ob/ob mice is 5-10 
fold lower than those used in leptin-resistant db/db mice. In ob/ob mice, the signaling effect of 

1 0 leptin could result either from interaction with the Ob-receptor or the LSR. 

A synthetic peptide with a sequence identical to that of mouse leptin between residues 
117-138 was obtained and found to stimulate the oleate-induced binding of 125 I-LDL in primary 
cultures of rat hepatocytes (Fig. 1 1 B, insert). Daily subcutaneous injections of 25 ug of this 
synthetic leptin peptide to ob/ob mice had no effect on the food intake over a 12 day period (7.6 

15 ±0.4 g/day in ob/ob receiving saline and 6.7 ± 0.3 g/day in ob/ob receiving peptide; n = 4, NS). 
Daily injections of 25 ug of mouse leptin caused a reduction of food intake to 4.7 ± 0.5 g/day (n 
= 3; p < 0.003 versus controls). Thus, the synthetic peptide that activates LSR in vitro does not 
influence food intake by activating the Ob-receptor. Injection of 50 ng of this synthetic peptide 
reduced the postprandial lipemic response in ob/ob mice (Fig. 1 IB). 

20 

EXAMPLE 9: Relevance to Disease States 

The instant invention has shown that leptin regulates cellular functions in the absence of 
functional Ob-R. A myriad of peripheral regulatory effects of leptin have been identified and 
attributed to leptin signaling through the Ob-R, even when the targeted tissues lack the long 

25 isoform of the Ob-R, i.e., the sole isoform with a clearly established signaling capacity 

(Friedman, J.M., and Halaas, J.L. (1998). Nature 595,763-770). The characterization of a 
leptin receptor distinct from the Ob-R and controlling the entry of exogenous TG into the liver 
opens the possibility that leptin controls other aspects of cell metabolism independently of the 
Ob-R. Although not wishing to be limited to a particular theory, one hypothesis is that leptin 

30 resistance is due to desensitization of the signaling pathway through which leptin binding to LSR 
leads to mobilization of the receptor to the cell surface. 

Leptin regulation of the exogenous lipoprotein pathway opens new perspectives towards 
the understanding of the relationship between obesity, hypertriglyceridemia and cardiovascular 
disease. Indeed, accumulation in plasma of the residues of chylomicrons has been shown to 

35 increase the risk of cardiovascular disease due to the formation of atherosclerotic plaque (Karpe 
etal, 1998 Atherosclerosis 141,307-314). Hypertriglyceridemia is also considered an 
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independent predictor of cardiovascular disease in obese subjects with Type II diabetes (Feeman, 
1998 Ann. Intern. Med. 128, 73-74). 

By increasing the contribution of the liver to the removal of plasma TG, leptin prevents 
deposition of dietary lipid in adipose tissue in excess of their FFA-releasing capacity. Thus the 
5 liver plays a critical but underestimated role in the pathogeny of obesity. 

EXAMPLE 10: Molecular Modeling of an Active Leptin Fragment of the Invention 

The amino acid sequence for the human leptin fragment with activity is: NH 2 - 
CHLPWASGLETLDSLGGVLEAS-COOH (SEQ ID NO:57; residues 117-138). The amino acid 
1 0 sequence of the mouse leptin fragment with inhibitory activity in the human system is: NH 2 - 
CSLPQTSGLQKPESLDGVLEAS-GOOH (SEQ ID NO:67). 

A molecular dymamic assay (MD) was performed on both the human and the mouse 
22aa peptides. MDs were performed under AMBER force field, in vacuo, with a dielectric 
constant proportional to 4r, a switched cutoff with inner radius of 10A and outer radius of 14 A, a 
1 5 heating phase of 20ps from 0 to 300K by steps of 50K, and a production phase of 1 20ps at 300K. 
At the end of the 120ps MDs, both peptides have lost their short helical part, and have shrunk to 
a more compact conformation. 

The main difference between the human and mouse 22aa peptides in the packed 
conformations is the presence of a residue with higher accessibility (namely L133, before the 2 
20 Glycines of the end sequence LGGVLEAS) in the human 22aa peptide. 

In order to decipher which amino acid is important amoung the 126-129 amino acid 
residues, which differ significantly between human and mouse, the following in-silico 
combinatorial mutational assay was performed. 



25 from the human leptin) was mutated, resulting in 16 mutated peptide models. Each model was 
minimized until reaching an rms gradient of 0.1 Kcal/mol (within the AMBER force field). 
Then, each minimized model was used as the starting conformation of ultra-short molecular 
dynamics (MD) assay (heating phase from OK to 300K of 20ps, and production phase at 300K of 
20ps, in vacuo, under the same conditions as described above). The final MD snapshots were re- 

30 minimized, and the corresponding energies are given in the following HTML table, as well as the 
sequence of the spontaneously formed alpha helices. 



Each residue in positions 126-129 of the 22aa human peptide (conformation extracted 



Energies of 16 Mutated Human 22aa Leptin Peptides 



Central Sequence 



LD 



LE 



PE 



PD 



35 



-87.4 



-79.3 



-83.9 



-69.3 



LDSLGG 



TPDSL 
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(SEQ ID NO:42) 




(SEQ ID NO:46) 


QT 


-66.0 


-83.3 


-68.0 -65.4 




GLQTLDSLG 
(SEQIDNO:47) 


GGVLE 

(SEQ ED NO:48) 


TPDSLG 
(SEQEONO:49) 


EK 


-82.5 


-93.1 


-92.2 -92.2 




SLGGVLEAS 
(SEQIDNO:50) 


PESLGG 
(SEQIDN0:51) 


PDSLGG 
(SEQBDNO:52 


QK 


-83.3 

LGGVLEA 
(SEQ ID NO:53) 


-85.2 


-90.2 -84.2 



1 5 Left column: first 2 aa residues of the mutated ETLD (SEQ YD NO:40) human motif. First line: 
last 2aa residues of the mutated ETLD (SEQ ID NO:40) human motif. Information available in 
each cell: energy of the minimized 20ps snapshot (Kcal/mol), and alpha helix sequence if 
present in the 20ps snapshot. Peptides containing ETLD (SEQ ID NO:40; human motif) and 
QKPE (SEQ ID NO:41 ; mouse motif) are in italic. 

20 Under these conditions, the EKLE (SEQ ID NO:43), EKPE (SEQ ID NO:44) and EKPD 

(SEQ ID NO:45) containing peptides are the most favorable ones and have an alpha helix. 
QKPE (SEQ ID NO:41; mouse motif) and ETLD (SEQ ID NO:40; human motif) containing 
peptides are the next favorable conformations, with an alpha helix for ETLD (SEQ ID NO:40). 
Since the residue composition of each peptide is different, both composition and conformation 

25 energies form part of the comparison, and not only conformation energies. 

Other peptides of the invention that can be tested in the assays described herein or other 
comparable assays for LSR agonistic or antagonistic activity include the following : 

Table 



Human Leptin Peptide Fragments 



Position 


Sequence 


SEQUENCE ID NUMBER 


117-138 


CHLPWASGLETLDSLGGVLEAS 


SEQIDNO:57 


122-143 


ASGLETDSLGGVLEASGYSTE 


SEQIDNO:60 


127-148 


TLDSLGGVLEASGYSTEWALS 


SEQIDNO:62 


132-153 


GGVLEASGYSTEVVALSRGQGS 


SEQEDNO:63 


112-133 


AFSKSCHLPWASGLETLDSLGG 


SEQIDNO:56 


107-128 


LLHVLAFSKSCHLPWASGLETL 


SEQIDNO:55 


102-123 


ENLRDLLHVLAFSKSCHLPWAS 


SEQIDNO:54 


119-136 


LPWASGLETLDSLGGVLE 


SEQIDNO:58 
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121-134 


WASGLETLDSLGGV 


SEO ID N059 


123-132 


SGLETLDSLG 


SEQ ID NO:61 




Table 




Mouse Leptin Peptide Fragments 




Position 


Sequence 


SEQUENCE ID NUMBER 


117-138 


CSLPQTSGLQKPESLDGVLEAS 


SEQE>NO:67 


122-143 


TSGLQKPESLDGVLEASLYSTE 


SEQIDNO:70 


127-148 


KPESLDGVLEASLYSTEVVALS 


SEQIDNO:72 


132-153 


DGVLEASLYSTEWALSRLQGS 


SEQIDNO:73 


112-133 


AFSKSCSLPQTSGLQKPESLDG 


SEQEDNO:66 


107-128 


LLHLLAFSKSCSLPQTSGLQKP 


SEQE>NO:65 


102-123 


ENLRDLLHLLAFSKSCSLPQTS 


SEQIDNO:64 


119-136 


LPQTSGLQKPESLDGVLE 


SEQIDNO:68 


121-134 


QTSGLQKPESLDGV 


SEQIDNO:69 


123-132 


SGLQKPESLD 


SEQIDNO:71 



5 EXAMPLE 1 1: Inhibition of the Expression of Endogenous LSR Using Chimeraplasty 

Chimeraplasty experiments to inhibit the expression of cellular LSR are designed based 
on publications by Cole-Strauss et al. (Science 273 :1386-1389 (1996)) and Alexeev and Yoon 
(Nature Biotech. 16:1343-1346(1998)). The following Example is exemplary only. Other sites 
in LSR can be targeted using the same approach to achieve either inhibition of expression, or to 
1 0 change base pairs to study the importance of various residues (both protein coding and within 
regulatory regions , intronic, or 5' or 3' to the coding region) for LSR functioning in vitro and in 
vivo. Similarly, chimeric oligonucleotides can be designed to modify LSR amino acids either in 
the coding or non-coding regions in experimental animals and for treatment of diseases in 
humans. 

1 5 There are two ATG codons in human LSR. The second ATG corresponds to the ATGs 

present in mouse and rat LSR. The first ATG is used as the start site for at least some of the 
forms at least some of the time, since the N-terminal antibody 8 IB is specific for this region of 
the LSR protein (See other Examples). Therefore, chimeric oligonucleotides were designed for 
the region after the first ATG and before the second ATG, and the region after the second ATG. 

20 The first step was to identify regions of LSR where changing a single base pair results in 

the creation of a stop codon. Although there are three stop codons, TAG (amber), TAA (ochre) 
and TGA (stop), TGA is preferred for giving a complete stop (complete inhibition of LSR 
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expression). Two regions were identified (one after the first ATG and one after the second 
ATG) where changing a single base pair would result in a TGA stop codon, and chimeric 
oligonucleotides were designed for the appropriate sequences (Fig. 9). Chimeric 
oligonucleotides are designed such that they will basically form a double-stranded sequence with 

5 two sets of 4Ts at the bends and a GC-clamp (typically 5 bases in length) at one end and the 
mutated sequence and its wild-type complement forming the main part of the double-stranded 
part (typically 25 bases in length). Flanking the mutated sequence (typically 5 DNA bases) is 2'- 
o-methyl RNA sequence (typically 10 bases on either side). 

Primers and probes were also designed for these regions for use in an allelic 

10 discrimination assay (PE Applied Biosystems, «Allelic Discrimination Using 5' Nuclease 

Assays» www2.perkin-elmer.com/ab/apply/dr/dralb4.html). The use of flourogenic probes in a 
5' nuclease assay combines PCR amplification and allele detection into a single step. 
Hybridization probes for the endogenous and mutant forms of the allele are included in the PCR 
amplification reaction. The hybridization probes are cleaved by the 5' nuclease activity of Taq 

1 5 DNA polymerase only if the probe's target sequence is being amplified. By using a flourogenic 
probe, cleavage of the probe can be detected without post-PCR processing. The flourogenic 
probe comprises an oligonucleotide labeled with both a flourescent reporter dye (typically 5') 
and a quencher dye (typically 3'). In the intact probe, the proximity of the quencher reduces the 
flourescent signal from the reporter dye. Cleavage liberates the reporter dye allowing an 

20 increase in its flourescent activity. The essence of the technique is that it can detect single 

nucleotide mismatches since these interfere with the ability of Taq DNA polymerase to cleave 
the probe. 

Probe placement is dictated by the location of the polymorphism. Generally, the 
polymorphic site should be near the center of the probe, since mismatches at the ends are not 

25 typically as disruptive to hybridization. A separate probe is synthesized for each allele, and each 
is labeled differently (FAM and TET or JOE, for example). The main criterion for probe 
selection is that it be long enough to hybridize at the annealing/extension temperature used in the 
PCR amplification. Calculation of the annealing/extension temperature is routine for those of 
ordinary skill in the art. Typically a probe Tm (melting temperature) of 65-67 C works well at 

30 an annealing temperature of 60-62 C. Therefore, the length of each probe is typically adjusted 
so that both probes have an estimated Tm of 65-67 C. In addition, there can be no G at the 5' 
end, since a G adjacent to the reporter dye quenches flourescence somewhat even after cleavage. 
The probes can be for either strand; the strand with more C's than G's generally performs better 
in the 5'nuclease assay. 

35 Primers are chosen based primarily of estimated Tms as well as small amplicon size. 

Primers with Tms of 58-60 C (approximately 5 C below the probe Tm) generally work well at 
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annealing/extension temperatures of 60-62 C. Generally, primers that are unstable at their 3' 
ends are preferred, as this seems to reduce non-specific priming. Therefore, primers with only 
one to two Gs and Cs within the last 5 nucleotides of the 3' end are preferred. In addition, 
primers should be placed as close as possible to the probe location without overlapping the 
5 probes. This generally results in amplicons of less than 100 bp, which is advantageous for PCR 
amplification success. 

First ATG : 

Chimeric oligonucleotides. DNA is in capital letters; 2*o-methyl RNA is in small letters; 
1 0 mutated base is underlined: 
5'- 

ATGCAACAGGACGGACTTGGAGTAGTTTTcuacuccaagTCAGTccuguugcauGCGCGTT 
TCGCGC - 3' (SEQ ID NO:74) 
Allelic Discrimination Assay: 
1 5 Forward Primer: TGTCCACGTCGTTTACGCTC (SEQ ID NO:75) 

Reverse Primer: TCCCACTTCCGTTCCTTGTC (SEQ ID NO:76) 

Probes (endogenous/mutant): 3' - CCTACTCCAAGTC(C /A)GTCCTGTTGCATT- 

5' 

(SEQEDNO:77) 

20 

Second ATG : 

Chimeric oligonucleotides. DNA is in capital letters; 2' o-methyl RNA is in small letters; 
mutated base is underlined): 

5' - GACCCTGCCCTGTACCTACCTACCAGATGTTTTcaucugguagGTTCAgggcagggucGCG 
25 CGTTTT - 3' (SEQ ID NO:78) 

Allelic Discrimination Assay: 

Forward Primer: GTGGTGATCCTCTTCCAGCCT (SEQ ED NO:79) 

Reverse Primer: CCAGATGACGATGGGTTGC (SEQ ID NO:80) 

Probes (endogenous/mutant): 5' - ACCCTGCCCTG(T/A)CCTACCAGATGAC - 3' 
30 (SEQIDNO:81) 

The chimeric oligonucleotides are also made flourescently labeled to allow tests for transfection 
efficiency. 

Following synthesis of the chimeric oligonucleotides and the primers and probes for the 
35 allelic discrimination assay, the flourescein-labeled chimeric oligonucleotides are transfected 
into PLC cells using standard methodology (other Examples), and the transfection efficiency 
determined by flourescence. The proportion of cells that are flourescent (successful 
transfection) is compared with the total number of cells by techniques that are standard in the art. 
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If the transfection efficiency is low, various parameters of the transfection methodology may be 
modified to increase the transfection efficiency. These parameters are well-known in the art. 

Following a successful transfection of the flourescently-labeled chimeric 
oligonucleotides, the unlabeled chimeric oligonucleotides are transfected into PLC cells, and the 
5 cells are sorted using FACS (flourescent activated cell sorter) after labeling cells with a first anti- 
LSR antibody followed by a flourescently-labeled second antibody that binds the first antibody 
using methods standard in the art. The first antibody can be the N-terminal specific 8 IB 
antibody to sort cells for LSR expression following mutation of the site after the first ATG, but 
needs to be a more C-terminal specific antibody (such as the 170 antibody (to mouse carboxy 

1 0 terminus) or 93 A (to same region of human carboxy terminus)) to sort cells for LSR expression 
tested for creation of the stop codon and expression of LSR expression following mutation of the 
site after the second ATG. 

The cells in both groups with the lower LSR expression are collected to enrich for cells 
with the stop codon in at least one of the copies of LSR. The cells are then cultured and checked 

1 5 for the presence of the stop codon mutations using allelic discrimination. An exemplary reaction 
set-up and procedure is as follows : 



REAGENT 


FINAL CONC. 


(\iL) 


10X TaqMan Buffer A 


IX 


2.5 


25 mM MgCl 2 


5 mM 


5 


dATP 


200 nM 


0.5 


dCTP 


200 nM 


0.5 


dGTP 


200 uM 


0.5 


dUTP 


400 uM 


0.5 


AmpliTaq Gold (5 U/uL) 


1U 


0.2 . 


AmpErase UNG (1 U/nL) 


0.25 U 


0.25 


DEPC H 2 0 




2.55 


TOTAL VOLUME 




12.5 nL 



The primer concentrations can vary from 100 nM to 300 nM. Probe concentrations can vary 
from 50 nM to 200 nM. Template concentrations can vary from 0.1-100 ng/reaction. 
30 STEPS 

1. 50Cfor2min. 

2. 95 C for 10 min. 

3. 95 C for 15 sec. 

4. 58 to 65 C for one min. 
35 5. hold at 4 C 

Repeat steps 3 & 4 for 40 cycles. 
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Following testing, the cells are retransfected with the chimeric oligonucleotides and 
again sorted for LSR expression using FACS. The cells that are expressing the lowest amounts 
of LSR (or none) are selected, cultured to form a homogeneous population, and rechecked using 
allelic discrimination to identify cell clones that no longer express LSR. These cells can then be 
5 used in assays to study the role of the various LSR subunits and the interaction of compounds 
with particular subunits, as well as for screening for modulators of specific LSR activities 
(modulated by the different subunits, for example). In addition, the above-described techniques 
can be used on other cells, (including those in the ATCC databank and in animals or humans) to 
create other kinds of cells lacking LSR activity. As well as the uses as a research and compound 
10 screening tool, the technique is also useful for treatment of diseases related to obesity in vivo. 

Chimeric oligonucleotides were also designed to specifically inhibit either the a subunit 
of LSR, or both the a and the a' subunits of LSR, by targeting either Exon 4 or Exon 5, 
specifically. 
Exon 4 

15 Chimeric oligonucleotides. DNA is in capital letters; 2' o-methyl RNA is in small letters; 
mutated base is underlined): 

5 ' -TGGCTGAGCTCTTACCTGGTTTTCATTTTtgaaaaccagGTCAGagctcagccaGCGCGTT 

TTCGCGC - 3' (SEQ ID NO:82) 

Allelic Discrimination Assay: 

20 Forward Primer: G AGCTC ATCGTCCTTGGG AG (SEQ ID NO: 83) 

Reverse Primer: AGTCTTCTATGGGCCCCGC (SEQ ID NO:84) 

Probes (endogenous/mutant): 3 ' CACCGACTCGAGA(A/C)TGG ACC A AAAGTC 5' 

(SEQIDNO:85) 

25 Exon 5 

Chimeric oligonucleotides. DNA is in capital letters; 2' o-methyl RNA is in small letters; 
mutated base is underlined): 

5' - GGTTGTGGTATGCCTGGCTGCCTTCTTTTgaaggcagccAGTCAtaccacaaccGCGCGT 

TTTCGCGC - 3' (SEQ ID NO:86) 

30 Allelic Discrimination Assay: 

Forward Primer: ACGCAGAGCTCATCGTCCTT (SEQ ID NO:87) 

Reverse Primer: GATGCCCAGGAGGAGGAAGA (SEQ ID NO:88) 

Probes (endogenous/mutant): 3 ' - C A ACACC ATAC(G/T)G ACCG ACGG AA - 5' 

(SEQIDNO:89) 

35 

For both, use FAM as the dye for the endogenous nucleotide (A and G, respectively), and JOE as 
the dye for the changed nucleotide (C and T, respectively). 
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EXAMPLE 12: Use of Zinc Finger Polypeptides for LSR Modulation 

A method for specifically binding DNA of choice and repressing or initiating its 
transcription has been described recently in WO 98/543 1 1 . The repression or initiation can be 
constitutive in the presence of the vector carrying the zinc finger, or it can be placed under the 
5 control of a small molecule switch, for example the TET system, where the expression of the 
repressor/initiator-bound zinc finger can be regulated. This is especially important in systems 
where complete absence of a gene at certain developmental stages, for example, is lethal, or 
where it's overexpression is toxic (Massie B, Couture F, Lamoureux L, Mosser DD, Guilbault C, 
Jolicoeur P, Belanger F, Langelier Y Inducible overexpression of a toxic protein by an 
1 0 adenovirus vector with a tetracycline-regulatable expression cassette. J Virol 1 998 
Mar;72(3):2289-96). 

Zinc finger polypeptides are designed to specifically bind to LSR genomic DNA, and 
then are linked with the KRAB repressor to inhibit LSR expression. Sequences identified for use 
in making the zinc finger polypeptides are : 

15 1936 to 1927 of SEQ ID NO : 1 TAG GGG TGA GCG GCG GGG (SEQ ID NO:91) 

1947 to 1936 of SEQ ID NO : I GAG GGC TGG NNN TAG GGG TGA (SEQ ID NO:92) 
1946 to 1936 of SEQ ID NO : 1 AGG GCT GGG NN TAG GGG TGA (SEQ ID NO:93) 
1 956 to 1 947 of SEQ ID NO : 1 GTG GGA GCC GAG GGC TGG (SEQ ID NO:94) 
1956 to 1946 of SEQ ID NO :1 GTG GGA GCC N AGG GCT GGG (SEQ ID NO:95) 

20 2304 to 2295 of SEQ ID NO : 1 GCG GCG GCC GGG TGG GAG (SEQ ID NO:96) 
1778 to 1787 of SEQ ID NO : 1 TTG GCC GGA GCA GAT GGG (SEQ ED NO:97) 
1 787 to 1 798 of SEQ ID NO : 1 GCA GAT GGG NN CCG GAA GGG (SEQ ID N098): 
1946 to 1934 of SEQ ID NO : 1 AGG GCT GGG NNN AGG GGT GAG (SEQ ID NO:99) 
1934 to 1922 of SEQ ID NO :1 AGG GGT GAG NNN CGG GGA GGG (SEQ ID NO:100) 

25 1740 to 1749 of SEQ ID NO :1 AAG TGG GTC TCG GTT GCA (SEQ ID NO:101) 

The sequences to be bound by zinc finger polypeptides are provided to Sangamo, where 
the actual zinc finger proteins are synthesized and are linked to the KRAB domain, a 
transcription repressor (Pengue G, Calabro V, Bartoli PC, Pagliuca A, Lania L Repression of 
transcriptional activity at a distance by the evolutionarily conserved KRAB domain present in a 

30 subfamily of zinc finger proteins. Nucleic Acids Res 1994 Aug 1 1 ;22(15):2908-14), are 

synthesized. The DNA binding domain can also be linked to transcription initiators (such as 
VP16 ; Proceedings of the National Academy of Sciences USA 94 :5525 (1997)) or a small 
molecule switch system, that is used to turn on or off the zinc finger protein linked to the 
repressor or initiator. Examples of small molecule switches that are effective in cells and in 

35 animals include, the Tet system, RU486, and ecdysone. 
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The zinc finger proteins are delivered as plasmids suitable for transfection into cells 
using standard techniques (Fugene, is a method of choice). The cells used include, but are not 
limited to, the human cell lines HepG2, PLC, Hep3B f C3A, and 293 and the mouse cell lines 
taoBpRcl, BpRcl, and Hepal-6. All cells are available from ATCC. Following transient 
5 transfection, the cells are tested for LSR expression and activity using standard techniques 

described in this application, that may include FACS analysis to look for LSR expression on the 
cell surface, quantitative PCR to look at whether the message is being made, and various 
binding, uptake and degradation experiments to study LSR activity. 

Following a determination of which zinc finger proteins are the most effective in 

10 inhibiting LSR expression, stably tranfected cell lines are created, using the techniques described 
in this application. These cell lines are used to then study the activity of the subunits of LSR 
separately and in combination by co-transfecting them into the cells either stably or transiently, 
or by turning on and off endogenous LSR genes. These cell lines are the basis of assays for 
agonists and antagonists of LSR generally and the subunits separately and in any combination. 

1 5 The zinc finger proteins are also provided as part of a supernatant associated virus, or 

retroviral adenovirus (for example adeno-associated viral (AAV)). These are effective gene 
transfer vectors for use in cells or in animals, as well as humans. Upon receipt, the AAV 
supernatant is amplified using techniques well-known in the art and examples are described in 
Xiao et al. J. Virology 72 :2224-2232 (1998)) and can include the use of helper plasmids as 

20 described in Collaco et al (Gene (1999) 238:397-405). Following amplification, the supernatant 
is used to infect cells or preferably mice using standard techniques in the art some examples of 
which are provided by Snyder et al. (Nature Medicine 5 :64-69 (1999) and Teramoto et al. J. 
Virol. 72 :8904-8912(1998). 

Following infection, the cells are tested as described above ; the mice are tested for 

25 effects on fasting and post-prandial levels of triglycerides, free fatty acids, cholesterol, leptin, 
glucose, insulin, and adipoQ (Acrp30, Apml) as well as fragments thereof, for example, before 
and after feedings as described herein. Similarly to plasmids, contructs in AAV gene transfer 
vectors can be co-infected. Thus, mice or cells can be co-infected with constructs containing 
cDNA encoding the a, a', or P subunits either alone or in combination to study their role in vivo 

30 and to test the effects of agonists/antagonists on specific subunits, or subunit combinations, in 
animals or cells. 



35 
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Sangamo's Universal GeneTools technology platform enables the rational design and 
rapid generation of highly specific ZFP transcription factors that can selectively recognize and 
regulate/modulate transcription of any target gene or DNA sequence. Expression of the ZFP's as 
fusions to activation (herpes simplex virus VP 16) or repression (Kruppel-associated box A 
5 domain / KRAB-A) domains allows transcription to be specifically up or down modulated within 
cells. Figure 25 contains a table with a summary of the five sets of plasmids encoding ZFPs 
targeted to the LSR gene. Each set contains the ZFP target sequence fused to the VP 16 domain 
(NVF), or the KRAB-A domain (NKF). The sequences for the NVF versions of these plasmids 
are listed in Figure 26. These engineered ZFP's are being used for the functional analysis of LSR 
10 in both cell-based assays and in animal models. 

Cell Based Assays: 

To determine the effect of these engineered on LSR expression, mouse hepatocytes were 
transfected and assayed for LSR mRNA by Northern analysis. Hepal-6 cells transfected with 

1 5 ZFP-NVF constructs, were harvested 24 and 48 hours post transfection for total RNA isolation 
(Qiagen RNeasy mini kit). Standard protocols were followed for Northern gels and blotting. 
Blots were probed with the full length mouse LSR alpha cDNA (EcoRI fragment from pTracer 
clone) and G3PDH DNA (Clontech). Probes were prepared using Prime-IT II random primer 
labeling kit (Stratagene) and 32p dCTP. Quantitation of the Northern bands was done using Gel- 

20 Pro software. 

Figure 27 shows an analysis of all 5 candidate ZFPs linked to VP 16. Only 2 of these 
plasmids, 5185 and 5186, exhibited any increase in expression, 6% and 16%, respectively, at 48 
hours post-transfection. Since this increase was not very large, a more detailed analysis of these 
2 ZFPs by Northern and QPCR was used to confirm the up-regulation of LSR by 5185 and 5186. 

25 Hepal-6 cells transfected with ZFP-NVF constructs in triplicate, were harvested 24 and 

48 hours post transfection for total RNA isolation (Qiagen RNeasy mini kit). Standard protocols 
were followed for Northern gels and blotting. Blots were probed with the full length mouse LSR 
alpha cDNA (EcoRI fragment from pTracer clone) and G3PDH DNA (Clontech). Probes were 
prepared using Prime-IT II random primer labeling kit (Stratagene) and 32p dCTP. Quantitation 

30 of the Northern bands was done using Gel-Pro software. The results show an average of 28% 
mRNA increase with 5 1 86 and a 24% increase with 5185 (Fig. 28). It should be noted that there 
was no significant increase in LSR mRNA on either Northern at the 24-hour time point. 

Since the Northern analysis is not quite as sensitive as QPCR, the transcriptional 
increase was confirmed using QPCR. Cells were harvested 48 hours post transfection for Total 

35 RNA isolation (Ambion RNaqueous Kit). RNA was then reverse transcribed to generate cDNA 
for PCR analysis. Primer and Probe sets directed toward the mouse LSR and control GAPDH 
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sequences were used to quantitate levels of transcription in ZFP transfected cells. As shown in 
Figure 29, QPCR results indicate a 41% increase in LSR transcription when Hepa 1-6 cells are 
transfected with ZFP plasmid 5 1 86-NVF and a 30% increase with ZFP plasmid 5 1 85-NVF. 
These results indicate that both 5185 and 5186 plasmids were functioning in cells. 
5 Binding-Uptake-Degradation (BUD) studies were used to assay the ability of these 

plasmids to increase the cells ability to process 125 I-LDL. Cultures of Hepal-6 mouse 
hepatocytes were transfected with ZFP's plasmids 24 hrs after plating. Cells were transfected 
with l|ag plasmid/well in a 6well plate, using Lipofectamine (Gibco BRL) according to 
manufacturer's instructions. Forty-eight hours post transfection, Oleate-induced 125I-LDL 

1 0 binding, uptake, and degradation was measured as described herein. 

Results of the BUD studies indicate increased binding and uptake of labeled LDL when 
Hepal-6 cells are transfected with ZFP's 5186-NVF and 5185-NVF when compared to control 
transfected cells. The data in figure 30 have been corrected either for total protein (30A-30C) or 
for P-gal (30D-30F), which is a crude measure of the transfection efficiency. BUD data supports 

1 5 a role for ZFP 5 1 86-NVF and 5 1 85-NVF in the transcriptional activation of LSR and confirms a 
corresponding increase in functional activity. 

The increase in LDL binding and uptake suggests an increase in expression of LSR at 
the cell surface. To prove this, cells transfected with the ZFPs were analyzed by Flow cytometry 
(FACs) Analysis. FACs analysis (described above) allows for direct estimation of the proportion 

20 of positive cells in a population, as well as an indirect measure of the level of receptor on the cell 
surface (mean fluorescence intensity). 

Hepal-6 cells were transfected with ZFP-NVF constructs 5186 and 5185, along with 
control plasmids. Forty-eight hours post transfection, cells were analyzed for cell surface 
expression of LSR in the presence/absence of Leptin (20 ng/mL). Staining of Hepal-6 cells 

25 involved incubation with primary antibodies, generated in rabbits against mouse LSR NH2 
terminal sequence CPDRASAIQ, or mouse COOH terminal sequence EEGHYPPAPPYSET, 
followed by detection with a fluorescent-labeled secondary antibody against IgG rabbit (Sigma). 

Results indicate that in the presence of Leptin, Hepal-6 transfected with plasmid 51 85- 
NVF had a 50% increase in the level of LSR on the cell surface when compared to controls. 

30 While cells transfected with 5 1 86-NVF had a 35% increase in LSR at the cell surface. These 
findings support a functional role for ZFP 5 1 85-NVF and 5 1 86-NVF in the transcriptional up- 
regulation of LSR and concomitant increase of LSR on the cell surface. 

Analogous experiments are used to assess the efficacy of ZFP-NKFs for repressing LSR 
transcription. 

35 

EXAMPLE 13: Retroviral Library Screening bv FACS 
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In order to identify more genes involved in the regulation of LSR and in ligand signaling 
through LSR (leptin, Clq, AdipoQ (Acrp30, Apml), triglyceride-rich lipoproteins, etc) a 
retroviral library screening assay has been designed. In its most basic form, cells expressing 
LSR (PLC or HepG2, for example) are transfected with a retroviral library. Following sorting 

5 for expression of a marker protein, the cells are treated with a LSR ligand (leptin, for example) 
and assayed for LSR expression by FACS following staining with an antibody to LSR. Cells of 
interest, are those that either express more LSR or less LSR than is expressed following leptin 
stimulation of the same cells without the retroviral library. 

The assay takes advantage of a retroviral vector developed by Lodish at the Whitehead 

10 Institute for Biomedical Research that takes advantage of the spectrum of expression levels of 
cloned cDNAs while simultaneously maintaining the high efficiency of retroviral gene transfer. 
The vectors employ an encephalomyocarditis virus IRES (Jang et a/J. Virol. 62 :2636-2643 
(1988)), followed by a quatitative selection marker, such as green fluorescent protein (GFP) or a 
cell surface marker protein, that are detectable by intrinsic fluorescence or by staining live cells 

15 with a fluorescent antibody, respectively (Fig. 14). Because expression of the two reading 

r frames is strongly correlated , FACS sorting based on the GFP or cell surface marker protein can 
be used to sort the cells for those cells expressing the unknown protein at a desired level-high, 
low, or moderate. For the proposed assay, the cells would preferentially be sorted for moderate 
expression, to allow a detectable, but not overwhelming effect 

20 The individual members of the gene library are placed upstream of the IRES (Fig. 14). 

Genes of interest for screening for their effect on LSR expression on the cell surface include 
cDNA libraries from liver or adipose cells. Cells expressing LSR (such as Hep3B, HepG2, PLC) 
would be transfected by the library using standard techniques so as to achieve approximately 1 
clone (gene) per cell. The cells would then be screened, and those with moderate expression of 

25 GFP would be selected for. Cells where endogenous LSR expression has been knocked out 
either by traditional methods, or using the Sangamo (zinc finger proteins) or chimeraplasty 
techniques described herein could also be used by co-transfecting various subunits of LSR (from 
1-3 and any combination thereof), or in cells stably expressing recombinant LSR subunits, or 
combinations. 

30 In the GLUT 4 system, described by Lodish (Whitehead), the GLUT4 gene was linked to 

7 c-myc epitope tags and then GFP fused in frame at the carboxy terminus. This allows the 
quantity of the gene to be studied in the cell compartment where it is sequestered by comparing 
overall fluorescence with the GFP to cell surface fluorescence with anti-myc antibodies. A 
similar assay is envisioned for LSR where LSR could be fused to GFP (in this case the library 

35 would have to be linked to CD2 or CD4). Alternatively, the amount of LSR sequestered in a 
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cellular compartment could be determined using the 8 IB antibody, for example, and the amount 
of LSR on the cell surface could be determined using the 93 A antibody, for example. 

Once infected cells expressing moderate amounts of GFP are obtained, the cells can be 
treated with leptin, for example, (or any other LSR ligand of interest) and the difference in LSR 
5 levels in the compartment versus the cell surface, or simply on the cell surface can be determined 
by FACS (after antibody staining). Populations that have decreased LSR or increased LSR 
levels could be selected for. Optionally, the cells could be re-selected and then the retroviral 
DNA from the cells PCR'd and sequenced. Samples that appeared to be interesting by 
homologies or locations, for example, could then be cloned and re-transfected for further study. 
1 0 This would alloW the other genes that interact with this system to be discovered. The genes are 
likely to encode proteins whose modulation could have a direct impact on the regulation of 
obesity. 

EXAMPLE 14: Effect of the Leptin Peptide in Mice with Congenital Lipodystrophy 
1 5 Congenital generalized lipodystrophy (CGL) is a rare autosomal recessive disorder 

characterized by a paucity of adipose tissue which is evident at birth and is accompanied by a 
severe resistance to insulin, leading to hyperinsulinemia, hyperglycaemia, and enlarged fatty 
liver (Seip et al Acta Pediatr Supp. 413 :2-28 (1996)). Leptin has been shown to reverse insulin 
resistance and diabetes mellitus in mice with congenital lipodystrophy (Shimomura et a/.Nature 
20 401 :73-76 (1999)). These mice have extremely low levels of leptin in plasma. However, the 
authors do not link the effect of leptin with LSR. The instant invention includes the use of the 
the leptin peptides of the invention for treatment of lipodystrophy and for use in this mouse 
model. 

Leptin peptide will be provided to transgenic mice expressing SREBP-lc436 in adipose 
25 tissue under the control of the adipocyte-specific aP2 promoter/enhancer (Shimomura et al. 

Genes Dev. 12 :3182-3 194 (1998)). The levels used are similar to those described for the ob/ob 
mice herein, a range around 50 ng per mouse. Leptin is provided daily for 12 days, either by 
injection, or using micro-osmotic pumps. Plasma glucose will be measured using a glucose 
(Trinder)-lOO kit, plasma insulin by an anti-rat insulin radioimmunoassay (linco), and plasma 
30 leptin and triglyceride by standard methods described previously. A similar experiment is 
performed where the food intake is restricted to a level that is consumed completely by all 
animals. 



35 
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Example 15 : Effect of Truncated Human LSR on Binding. Uptake & Dejgradation of LDL 

Truncated forms of the LSR receptor were made and tested for their ability to function 
as either dominant positive (i.e. increase the activity of the receptor) or dominant negative 
proteins (i.e. interfere with the activity of the receptor), when over-expressed in cultured cells. 

5 

Materials : 

-Human LSR cDNAs a, a' and p from constructs made in pTracer CMV2. 
-pcDNA/HisMax vector from Invitrogen 

-Appropriate restriction enzymes, T4 DNA polymerase I and Klenow, and T4 DN A ligase. 
1 0 -Standard cloning procedures from "Molecular Cloning" by Sambrook et al. 
-Follow construct plan (Figure 15). 
Method of Cloning & Testing . 

1 . Digest Human LSR plasmids with enzymes of interest under appropriate conditions. 
Separate the appropriate insert fragment from the vector using agarose gel electrophoresis 

15 and Qiaquick gel extraction columns. Note: For constructs 1, 2, 3, 4a, 5, and 6 

pTracerCMV2 LSR a was used as the source for the insert. For construct 4b, pTracerCMV2 
LSR a' was used as the source for the insert. For 4c, pTracerCMV2 LSR P was used as the 
source for the insert. 

2. Digest the pcDNA/HisMax vector in the appropriate reading frame with the enzymes of 
20 interest. Purify using agarose gel electrophoresis and Qiaquick gel extraction columns. 

3. If necessary, treat insert fragments with Klenow DNA polymerase or T4 DNA polymerase I 
to blunt 3' overhangs. Purify DNA from the reaction using Qiaquick PCR purification kit. 

4. Ligate inserts into vector according to Sambrook et al. using a 3-5 M excess of insert to 
vector. 

25 5 . Transform plasmids into competent E. coli - XL 1 blue from Stratagene. Follow 
manufacturer's instructions. 

6. Isolate colonies with correct plasmids by either PCR or Qiagen miniprep analysis. 

7. Verify correct clones by having them sequenced to ensure that they are in the proper reading 
frame and that there are no amino acid changes. 

30 8. Grow and harvest DNA from large-scale cultures using Qiagen endotoxin free maxi preps. 
9. Analyze constructs by transfecting them into human cells and assaying LDL binding using 
the standard BUD protocol. 

BUD Assay Materials : 
35 -DNA from LSR truncated constructs at approximately 1 mg/mL. 

-Lipofectamine Plus transfection reagent -Life Technologies Cat. No 10964-013 
-PLC cells plated at 0.3 x 10 6 cells/well in a 6 well plate. 



10 
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- I25 I-LDL 

-10 mM suramin (70 mL PBS per 1 g suramin) 

-100 mM oleate in isopropanol, freshly prepared from a 400 mM stock solution 

-DMEM (without CaCl 2 ) containing 0.2% (w/v) BSA, 5 mM Hepes, 2 mM CaCl 2 , pH 7.5, and 

3.7 g/L NaHC0 3 (this media should be prepared before the experiment, stored at 4°C, and used 

for up to 1 week) 

-PBS, pH7.4 

- PBS containing 0.2 % (w/v) BSA 

- 0.1 N NaOH containing 0.24 mM EDTA 



BUD Assay Methods: 

1 . Cells (adherent) in 6-well plates seeded at 3 x 10 5 cells 3 days prior to the BUD. 

• Transfect the cells using lipofectamine plus reagent, according to the manufacturer's 
instructions, the day after the cells are seeded. Confluence should be between 50-80% when 

15 transfected. 

• Let cells go about 48 hrs (2 days) after transfection before BUD analysis. 

2. Wash cells once with PBS (room temperature), 2 mL/well 

3. Add DMEM/0.2% BSA (950 uL) 

4. Add oleate, (0 to 1 mM oleate, e.g. 0, 0. 1 mM 0.2 mM, 0.5 mM, 0.8 mM and 1 mL, from 
20 100 mM stock) 

• never exceed 10 uL isopropanol per mL DMEM 

• It is necessary to include wells with no oleate as a control for background. This control 
allows one to calculate the amount of oleate-induced ,25 I-LDL metabolized. 

5. Add appropriate concentration of 125 I-LDL to each well (50 ui of each dilution). 

25 6. Incubate cells for 90 min to 4 hours at 37 °C in a C0 2 incubator. In these experiments, 3 hrs 
was the incubation time. 

7. Transfer media from wells into 5 mL polycarbonate tubes. Store at 4 °C overnight for 
degradation analysis (see below). 

8. Wash cells at 4 °C (on ice): 

30 • Wash 2 times consecutively with ice-cold PBS/0.2% BSA 

• Wash once with ice-cold PBS/0.2% BSA 

• Wash 2 times consecutively with ice-cold PBS 

9. Add 1 mL/ well 10 mM suramin and incubate at 4 °C for 1 hour. 

10. Remove suramin into gamma counter tubes, and count for radioactivity. This represents the 
35 amount of ,25 I-LDL bound to the cell surface. 
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1 1 . Add 0.1 N NaOH/0.24 mM EDTA (ImL/well) and incubate at room temperature for a 
minimum of 30 min. to lyse the cells. 

12. Recover the cell lysates into gamma counter tubes and count for radioactivity. This 
represents the amount of l25 I-LDL internalized . Alternatively, the suramin step may be 

5 omitted (LSR as leptin receptor) and the cells lysed immediately after washing. This would 

represent the amount of cell-associated ,25 I-LDL or l25 I- leptin. 

13. After cell lysates have been counted, determine the protein concentration per mL so that data 
can be reported as ng ,25 I-LDL bound/mg of total protein. Protein is determined using the 
BCA assay from Pierce according to the manufacturer's instructions. Alternatively, data can 

1 0 be corrected for P-Gal units by transfecting extra wells and collecting them for the P-Gal 

assay at the time the BUD is done. For this protocol, see P-Gal protocol, below. 

Degradation of 125 I-LDL 

After leaving overnight at 4 °C, add 1 mL ice-cold 40% TCA to the pre-cooled media. 
Do not Vortex. 
Incubate 1 hour at 4 °C. 

Centrifuge at 3000 rpm (Beckman Allegra centrifuge), 30 min @ 4 °C. (If the precipitate is 
floating, it is necessary to break the air-water interface by gently shaking the tubes before 
pelleting. 

Transfer 1 mL supernatant to 5 mL glass tubes. 
Add 40 \iL 30% H 2 0 2 and vortex briefly. 

Add I mL chloroform and vortex briefly. Let tubes sit for 15 minutes to allow separation of 
the 2 phases. 

Transfer 0.5 mL to gamma counter tubes, and count for radioactivity. 
For the calculation of the amount degraded, the dilution factor is 4.16. Corresponding plates 
without cells serve as controls to define the level of the background. 

B-gal Assay 

1). Transfect cells with test construct + 1/8* the amount of P-gal expressing plasmid. 
30 2). Harvest cells in lysis buffer (250 |aL/well of a 6 well plate). Pull through a syringe several 
times before transferring into an eppendorf tube. 

3) . Freeze cells at -80 °C until ready to perform the assay. 

4) . Thaw cells of interest and spin at 14K in a microfuge at 4 °C for 5 min. 

5) . Transfer 10 jiL of each lysate to a clear PP 96 well plate: 

35 



15 

2. ) 

3. ) 

20 4.) 

5. ) 

6. ) 

7. ) 

25 8.) 
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Example 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


A 


Blank 
10 uL 


Blank 
10 nL 


empty 


Sample 
2 10 uL 


Sample 
2 10 uL 


Sample 
2 lOuL 


Etc 












B 


Control 
♦100 uL 


Control 
*100uL 


empty 


Sample 
3 10 uL 


Sample 
3 10 nL 


Sample 
3 10 uL 














C 


Sample 
1 10 nL 


Sample 
1 10 uL 


Sample 
1 10 nL 


Sample 
410nL 


Sample 
4 10 uL 


Sample 
4 10 uL 















* Control = reference standard #Blank = reaction buffer only 

6). Add Fluo-Reporter (3-gal substrate (Molecular Probes cat #F-2905) to P -gal reaction buffer. 



5 (275 CUG substrate [component A] to9.73 mL of reaction buffer) NOTE: need 10 mL for a 
96 well plate, but if you don't use it all it can be stored at -20 °C for at least 6 months. 

7) . Add 100 \xL of Reaction buffer with substrate to each well. 

8) . Incubate at room temp, for 30 min. 

9) . Add 50 jiL of stop mix (0.2 M Na 2 C0 3 ) 

10 10). Read on Cytoflour plate reader with excitation at 360 and emission at 460. Gain should be 
set around 30. 

p-gal reaction buffer : [Final] 



0.5 M NaPhosphate pH 7.3 


40 mL 


0.1M 


1 M MgCl 2 


0.2 mL 


1 mM 


14.3 M B-mercaptoethanol 


629 nL 


45 mM 


ddH 2 0 


159.171 mL 


200 mL 



Lvsis Buffer: 






[Final] 


Buffer II 


9.875 mL 




100 % TritonXlOO 


100 uL 


1% 


400 mM DTT 


25 uL 


1 mM 


Buffer II 




[Final] 


1 M Tris-Ac pH 7.8 


50 mL 


100 mM 


1 M MgAc 


5 mL 


10 mM 


0.5 M EDTA 


1 mL 


1 mM 


ddH 2 0 


439 mL 


500 mL 



Results of BUD Assay : 



WO 01/21647 



98 



PCT/TO00/01470 



Addition of the C-terminal portion of LSR increased I-LDL binding (a), uptake (b) and 
degradation (c) in PLC cells (Fig. 16). I25 I-LDL degradation is increased almost 2 fold at 0.5 
mM oleate. Data in this experiment is corrected for protein only. The transfection efficiency 
was not monitored. All points were done in triplicate. In a separate experiment, addition of the 
5 C-terminal portion of LSR also increased ,25 I-LDL binding (a), uptake (b) and degradation (c) in 
PLC cells (Fig. 17). ,25 I-LDL degradation was increased 2-3 fold at 0.5mM oleate. Data in this 
experiment was corrected for transfection efficiency only. All points are in triplicate. 

The C-terminal portion of LSR from AA353 to 650 (the last AA) as well as the C- 
terminal portion from AA 353 to 541 are able to increase the binding, uptake and degradation of 

10 ,25 I labelled LDL in vitro (Figures 16 & 17). The increase is on the order of 2-3 fold for all 3 
measurements when corrected for transfection effeciency using the P-Gal reporter as a carrier in 
the test DNA. The increase in LDL metabolism is still on the order of 2 fold when data are 
corrected for total protein, depending on the oleate concentration. These constructs can be 
cloned into a vector to allow expression and testing in vivo for this dominant positive effect in 

15 animals using methods well known to those in the art. 

Example 16 : LSR Gene Expression in Liver and Brain of Lean and Obese Mice 

LSR gene expression was determined by quantitative PCR (QPCR) in liver and brain 
tissue of 7 different mouse models: normal and high fat diet-fed C57BL/6J mice (C57), 

20 C57BL6/J ob/ob (ob/ob), C57BLK/S, C57BLK/S db/db (db/db), NZB and NZO mice. The 

normal diet was obtained from Harlan Teklad (Teklad Certified LM-485 mouse/rat 701 1C), the 
high fat diet, also called cafeteria diet was from Research Diets (D12331, Rat Diet 58 kcal % fat 
and sucrose). The cause of obesity in the different models is high fat diet in the obese C57 mice, 
leptin deficiency in ob/ob mice, deficiency in functional leptin receptor in db/db mice. The 

25 cause of obesity in the NZO mouse is currently unknown (Lit 1-3). C57BLK/S and NZB mice 
are both lean and were used as controls since they represent the corresponding background strain 
of db/db and NZO mice, respectively. 

The qPCR results for the different LSR levels in the livers of different mouse strains are 
supported by irnmunohistochemistry result using methods well-known to persons of ordinary 

30 skill in the art. 

Reverse Transcriptase - Polymerase Chain Reaction 

Liver and whole brain were isolated from mice following perfusion with ice-cold saline 
containing 10 mM EDTA. Tissues were stored in RNAlater (Ambion, Austin) at 4 °C for 1 day 
and then at -20 °C. Liver total RNA was isolated using RNAqueous (Ambion, Austin) following 

35 the manufacturer's protocol. The amount of RNA was determined by absorption at 260 nm. The 
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quality of the isolated RNA was verified by the ratio 260/280nm (between 1.9 and 2.1 is good) 
and by denaturing agarose gel electrophoresis. 

RNA was reverse transcribed to cDNA using oligo dT plus an LSR specific primer and 
Superscript II (Gibco BRL) according to manufacturer's instructions. The LSR specific primer 
5 is in exon 6 of the LSR gene (5'ACGCATGGGAATCATGGC; SEQ ID NO:90). Plasmids 
containing mouse LSR-ct/a'/fl sequence were obtained by cloning RT-PCR products produced 
from mouse liver total RNA into pGEM-T easy (Promega). The sequence of the plasmid was 
confirmed by cycle sequencing on a ABI Prism 377 DNA Sequencer. 

Quantitative PCR was performed on a ABI Prism 7700 Sequence Detection System 

1 0 using TaqMan technology (PE Biosystems). TaqMan assay primers and probes were designed 
using Primer Express software (PE Biosystems) and were synthesized by Genset, La Jolla. Each 
probe was double labeled with the fluorescent reporter dye 6-carboxyfluorescein (FAM) 
covalently linked to the 5' end of the probe and the quencher dye 6-carboxy- 
tetramethylrhodamine (TAMRA) attached to the 3' end. Uracil-N-glycosylase technology (PE 

1 5 Biosystems) was used to prevent contamination with PCR product. 

PCR was performed using the following reagent concentrations : 25 mM MgC12, dNTPs 
at 200 \iM 9 except for dUTP at 400 jiM, 1 U of AmpliTaq Gold, 0.25 U AmpErase UNG. 
Primers were added at 300 nM and probes at 200 nM concentration. The forward and reverse 
GAPDH and LSR primers used are shown in Table 1 . PCR reaction conditions were 50 °C for 2 

20 minutes, 95 °C for 10 minutes, followed by 40 cycles at 95 °C for 15 seconds and 1 minute at 60 
°C. PCR was performed in 96 well reaction plates with optical caps and fluorescence was 
continuously followed for each reaction. cDNA corresponding to 15 ng of total RNA were used 
per PCR reaction. 

Quantification of LSR expression was obtained using a standard curve of the 
25 corresponding LSR plasmid covering a concentration range between SxlO* 6 and 5xl0" 10 M 
(approximately 10 6 to 10 2 copies). A standard curve of mouse (C56BL/6J) total liver RNA 
between 200 and 0.1 ng RNA was used to determine relative levels of GAPDH expression. 
Amplification plots were analyzed using SDS software (PE Biosystems). 

30 Table 1 



PCR primers and probes used to determine the expression level of mouse GAPDH and mouse LSR 
isoforms. 



Target 


Forward Primer 


Reverse Primer 


Probe 


Gene 








GAPDH 


AACGAOCCCTTCATTGA 


ciiaxjAiiciuGGOJir 


ACTCACGGCAAATTCAACGGC 




cere 


G 


ACAG 
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LSR 


GGCAGGAGAATCAOCAT 


GATCTTGGGCTGAGAOC 


TGCTXjG(XTGACCTTCGAGCA 


complete 


CACA 


AUJ 




LSR alpha 


GCCCITGGAAGATrGGC 


ATGCTTGGCACAOCTGA 


OCAGTGCTGICCCCACACCTGC 


TCT 


GOT 


T 


LSR alpha' 


AOCAGGGCAGGAGAAT 


GGAGGAAGAAGAGGAG 


AGCTCATRnXTTTCATrGGCT 


CACC 


GCTTG 


CTliUlG 


LSR beta 


TlUIUUliUlTlAlUClU 


CAGGAGAGAGGTGGGT 


AGCAGCCACCTCAGGTGTGOC 




CTGG 


ATAGATGC 


AA 



Quantification by TaqMan technology is based on determining the threshold cycle of 
amplification, which was determined for each unknown sample and for the standard dilutions 
using 0.1 fluorescence units as a threshold (maximum fluorescence > 1.5). The amount of 
unknown cDNA was calculated using the corresponding standard curve. LSR expression was 
given as absolute copy numbers and also normalized for GAPDH expression (by dividing the 
determined absolute copy number by the relative level of GAPDH for each individual animal). 
Each determination was done in triplicate and was repeated at least once; very similar results 
(SD<5%) were obtained. 

All data were confirmed by standard Northern analysis. 16ng total RNA was pooled 
from 4 mice per group and tissue and analyzed by Northern. Although this type of analysis is 
semi-quantitative at best and LSR isoforms can not be differentiated, relative levels of gene 
expression show the same trends as measured by QPCR. 

Results 

LSR Expression in Liver 

Table 1 



LSR gene expression in liver of lean and obese mice (copy numbers in 15ng total liver RNA) 





LSR-alpha 


LSR-alpha' 


LSR-beta 


LSR 


GAPDH LSR total 










(sum of isoforms) 






C57 normal 


ave 93966 


110334 


18454 


222754 


2.8 


281654 




SEM 21760 


16682 


2790 


39779 


0.4 


83220 




ave 42.2% 


49.5% 


8.3% 










SEM 2.5% 


2.4% 


0.3% 








C57 obese 


ave 82814 


44084 


17280 


144177 


6.0 


161206 




SEM 12274 


8073 


2344 


22521 


1.7 


21161 




ave 57.4% 


30.6% 


12.0% 










SEM 1.2% 


1.3% 


0.4% 








C57 ob/ob 


ave 49898 


51056 


21126 


122079 


9.1 


120026 




SEM 5928 


10469 


1758 


15113 


1.0 


32474 




ave 40.9% 


41.8% 


17.3% 










SEM 0.7% 


4.2% 


3.9% 








C57BLK/S 


ave 49029 


68379 


41340 


158749 


3.9 


163060 
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SEM 3862 


3721 


2043 


5903 


0.4 


94537 




ave 30.9% 


43.1% 


26.0% 










SEM 1.3% 


1.6% 


1.8% 








C57BLK/S 


ave 30625 


48504 


18683 


97811 


9.2 


79745 


db/db 
















SEM 1953 


12021 


3123 


10819 


1.0 


26413 




ave 31.3% 


49.6% 


19.1% 










SEM 1.7% 


7.0% 


5.4% 








NZB normal 


ave 98455 


387287 


54079 


539822 


3.1 


588656 




SEM 4446 


13253 


6740 


21241 


0.7 


27993 




ave 18.2% 


71.7% 


10.0% 










SEM 0.6% 


0.8% 


0.9% 








NZO obese 


ave 57497 


225574 


23377 


306448 


1.8 


333271 




SEM 4595 


11767 


1091 


15948 


0.3 


11416 




ave 18.8% 


73.6% 


7.6% 










SEM 0.9% 


1.1% 


0.2% 









LSR Expression in Brain of Lean and Obese mice 

Table 2 

LSR gene expression in brain of lean and obese mice (copy numbers in 1 5ng total liver RNA) 



LSR-alpha LSR-alpha' LSR-beta LSR GAPDH LSR total 

(sum of isoforms) 



C57 normal 


ave 1 192 


6443 


7731 


15365 


36.2 


10653 




SEM 155 


1512 


443 


1717 


3.0 


1933 




ave 7.8% 


41.9% 


50.3% 










SEM 0.5% 


6.0% 


6.3% 








C57 obese 


ave 1496 


10472 


7418 


19387 


20.8 


14118 




SEM 155 


1295 


716 


1998 


5.7 


805 




ave 7.7% 


54.0% 


38.3% 










SEM 0.5% 


1.9% 


2.2% 








C57 ob/ob 


ave 1293 


6502 


6158 


13954 


34.2 


14034 




SEM 190 


797 


475 


863 


5.2 


1939 




ave 9.3% 


46.6% 


44.1% 










SEM 1.0% 


3.4% 


4.4% 








C57BLK/S 


ave 1918 


5585 


6456 


13958 


26.7 


10458 




SEM 206 


354 


1024 


1087 


5.3 


980 




ave 13.7% 


40.0% 


46.3% 










SEM 1.7% 


2.8% 


4.2% 








C57BLK/S 


ave 1834 


5195 


8189 


15217 


35.0 


10912 


db/db 
















SEM 199 


297 


789 


1117 


4.5 


670 




ave 12.0% 


34.1% 


53.8% 










SEM 0.7% 


2.0% 


1.4% 








NZB normal 


ave 654 


1019 


5463 


7135 


17.0 


4430 




SEM 159 


321 


929 


1051 


4.7 


926 




ave 9.2% 


14.3% 


76.6% 










SEM 1.7% 


5.1% 


6.8% 








NZO obese 


ave 168 


320 


2715 


3202 


13.4 


1446 




SEM 112 


52 


37 


1638 


4.5 


1008 




ave 5.2% 


10.0% 


84.8% 
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SEM 12.9% 5.6% 16.8% 



C57BL6/J. C57BLK/S. db/db. ob/ob Mice 

LSR expression in the liver of obese animals is significantly lower than in lean control 
animals (Fig. 1 8). In general, the expression of LSR in brain tissue is much lower than in liver. 
5 However, unlike in liver, obesity does not cause further downregulation (Fig. 21). 

No significant differences in isotype patterns were found in liver samples from the 
different mouse models. LSR alpha and alpha' contribute equally and account for almost all of 
the total LSR expression. LSR beta contributes only a small percentage (Fig. 19 and Fig. 20). 

In contrast, LSR alpha' and beta are the major contributors to overall LSR expression in 
10 brain, accounting in equal proportions for about 90% of total LSR message. No significant 
levels of LSR alpha were seen in any of the studied models (Fig. 22 and Fig. 23). 

The downregulation of LSR seems to be strongly associated with obesity independent of 
the cause of obesity (dietary as well as different genetic defects are the causes in the used 
models). One might expect that upregulation of liver LSR expression in obese individuals would 
15 be beneficial. 

NZB and NZO Mice 

LSR expression in liver tissue of NZB mice is 2-fold higher than in normal C57 mice. 
Obesity (in the NZO) again leads to strong downregulation, however, this level is still 
20 significantly higher than in other obese mice (Fig. 1 8). In contrast, LSR expression in the brain 
of NZB and even more so in brain tissue of NZO, is significantly lower than in the other 5 
models (Fig. 21). 

Distribution of LSR isotypes in NZB and NZO mice was very different from the 
previously described 5 models. The dramatic increase in liver LSR expression seen in NZB (and 

25 in NZO) mice was found to be mainly LSR alpha'. This form accounted for 80% of total LSR 
(Fig. 19 and Fig. 20). The complete opposite was seen in brain tissue. NZB mice have very low 
expression of LSR alpha and alpha' with LSR beta being the dominant isoform. This picture is 
even more pronounced in NZO mice. Brain LSR in these animals is almost exclusively LSR beta 
and some animals had virtually no alpha or alpha' expression (Fig. 22 and Fig. 23). 

30 The fact that NZO mice respond to intracerebroventricular injection of leptin but not to 

peripheral injection (Halaas JL, et al., Proc.Natl.Acad.Sci. USA, 94, 8878-8883, 1997) suggests 
a transport defect. Since LSR alpha' has been shown to bind leptin, and since LSR alpha' levels 
are reduced in NZO mice, the implication is that the genetic defect in NZO mice causing obesity 
might be deficiency in brain LSR alpha' expression resulting in non-functioning leptin transport 

35 across the blood brain barrier. This conclusion is further supported by the discovery that some 
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NZO mice that do not become obese have LSR alpha' expressed at significant levels in brain. 

Example 17 : Effect of a Ser -> Asn substitution on LSR activity in human hepatocvtes 

Previously, we described a frequent (allele frequency 12%) G ^ A mutation of cDNA 
5 base pair 1088 (LSR exon 6), which results in a Ser *Asn mutation at amino acid position 363, 

presumably in the extra-cellular domain of the receptor. 

In a group of 34 obese adolescent girls, this coding mutation significantly increased 

fasting and postprandial plasma triglyceride response to a high fat test meal. In a larger 

population of 154 obese adolescent girls, the same coding mutation significantly and selectively 
1 0 influenced fasting plasma triglyceride levels and increased 3.5 fold the risk of 

hypertriglyceridemia. This data suggested that LSR plays a significant role in the clearance of 

triglyceride-rich lipoproteins. Interestingly, even individuals heterozygous at this locus showed 

the effect. 

An in vitro model was obtained after sequence analysis of LSR in 2 cell lines, PLC and 
1 5 HepG2, revealed that PLC cells are homozygous for the G allele, while HepG2 cells are 
heterozygous, having both the G and A.allele. 

Methods: 

The oleate-induced ,25 I-LDL binding, uptake and degradation was measured in HepG2 
20 and PLC according to the method described previously (Bihain, B.E., and Yen, F.T. 

(1992). Free fatty acids activate a high-affinity saturable pathway for degradation of low-density 
lipoproteins in fibroblasts from a subject homozygous for familial hypercholesterolemia. 
Biochemistry 31, 4628-4636.). Briefly, confluent monolayers of cells were washed once in 
phosphate buffered saline (PBS), and then incubated 3 h at 37 °C with increasing concentrations 
25 of oleate (as indicated) and 20 ng/mL 125 I-LDL. At the end of the incubation, cells were placed 
on ice and washed twice with PBS containing 0.2% BSA, once with the same buffer, and then 
twice with PBS alone. The amounts of l2S I-LDL bound, internalized and degraded were then 
measured according to the method of Bihain, B.E., and Yen, F.T. (1992). Free fatty acids 
activate a high-affinity saturable pathway for degradation of low-density lipoproteins 
30 in fibroblasts from a subject homozygous for familial hypercholesterolemia. Biochemistry J7, 
4628-4636. 

Results: 

The PLC cell line displayed a much greater capacity to bind, internalize and degrade ,25 I- 
LDL in the presence of increasing concentrations of oleate, as compared to the HepG2 cell line 
35 (Fig. 24). This is most marked in the degradation. The decrease in degradation observed with > 
0.5 mM oleate concentrations is thought to be due to the accumulation of oleate as triglycerides 
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in the cells. This increase in lipid in the cells decreases proteolytic degradation at the lysosomal 
level. 

Quantitative PCR and facs data indicates that LSR expression is almost 50% higher in 
HepG2 cells than in PLC cells. This would be consistent with the notion of compensation for the 
5 lower activity of the receptor in the cells. 

These in vitro data suggest that a person with a G/G genotype (hence Ser) would display 
a greater ability to clear triglycerides during the postprandial stage as compared to one with a 
G/A genotype. Since we have postulated a rate-limiting role of LSR in the removal of dietary 
lipid, these data could explain the significant association found between low postprandial 
1 0 triglyceride levels and G/G genotype. In contrast to G/G homozygotes, G/A heterozygotes with 
lower LSR activity would have a lesser capacity of removing dietary lipid, thus increasing their 
time in the circulation. This would in turn cause a change in the partitioning of lipid between the 
liver and the adipose tissue, leading to a greater deposition of fat in the adipose tissue. 

This example indicates the potential use of this polymorphism, as a marker to detect 
1 5 people with a propensity towards obesity. It also supports the hypothesis that LSR is a potential 
pharmaceutical target for the development of compounds aimed at targeting lipids away from the 
adipose tissue and towards the liver. 

Example 1 8: Leptin Transport through the Blood Brain Barrier 

20 Human leptin transport through the blood-brain barrier (BBB) is studied using an in vitro 

model (Dehouck, et al J Neurochem 54:1798-801, 1990). This model closely mimics the in vivo 
situation with regard to the selective passage of nutrients and drugs through the cerebrovascular 
endothelium. The presence of tight junctions that prevent non-specific diffusion, the expression 
of specific receptors such as LDL receptor and transferrin receptor, and the expression of P- 

25 glycoprotein in brain capillary endothelial cells in vitro demonstrates that this model is a useful 
system to study the selective transport through the BBB. Briefly, this model consists of a co- 
culture of bovine brain capillary endothelial cells (ECs) and rat astrocytes (Figure 31). The 
astrocytes are seeded on the plastic of a six-well dish and grown for 3 weeks. A collagen-coated 
filter is then set in each dish and bovine ECs are plated on the upper-side of the filter. ECs form 

30 a confluent monolayer in 5 days and they are used for experiments after 16 days of coculture 
with astrocytes. 



Methods 

Leptin transcytosis: Experiments were performed on brain capillary endothelial cells in 
35 coculture with astrocytes for 16 days. On the day of the experiment, ECs were transferred to a 
clean 6-well plate containing 2 mL of Ringer-Hepes buffer (see, Fig. 32). At time 0, 1 mL 
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Ringer Hepes containing 125 l-leptin was placed in the upper compartment. After 30, 60, 120, or 
180 min incubation at 37 °C on a rocking platform, the insert was transferred into another well of 
a six-well plate to minimize the possible passage of substances from the lower to the upper 
compartment. At the end of the experiment, the amount of radioactivity of each well was 
5 counted. The transcytosis was performed over 3 h with 1) 10 ng/mL ,25 I-leptin (10,000 dpm/ng), 
2) lOng/mL ,25 I-leptin + 1 Mg/mL of cold leptin, 3) 10 ng/mL ,25 I-leptin + 50 ng/mL peptides or 
4) 10 ng/mL 125 I-leptin + 2 mg/mL lactoferrin. The synthetic peptides studied include the human 
(HP) and mouse (MP) leptin peptide fragments : CHLPWASGLETLDSLGGVLEAS (SEQ ID 
NO:57) and CSLPQTSGLQKPESLDGVLEAS (SEQ ID NO:67), respectively. 
10 Sucrose and inulin permeability studies: The [ 14 C]-sucrose (342Da) and [ 3 H]-inulin 

(57000 Da) are hydrosoluble molecules which pass through the BBB through non-receptor 
mediated processes. The transport is nonspecific and primarily through tight junctions. These 
serve as markers for the integrity of the BBB and hence toxicity of the added compounds on the 
cerebral endothelium. 

15 After 16 days of coculture, permeability studies were performed as described in Figure 

32. On the day of the experiment, ECs were transferred to a new 6-well plate containing 2 mL of 
Ringer-Hepes. At time 0, 1 mL Ringer Hepes containing [ I4 C]-sucrose, [ 3 H]-inulin and cold 
leptin were placed in the upper compartment. Sucrose and inulin permeability studies were 
performed in the presence of 10 ng/mL leptin, 5 ng/mL leptin, 10 jig/mL leptin or without leptin 

20 as a control. The effect of peptides was also tested by the addition of 10 ng/mL Ieptin+50 ^ig/mL 
mouse peptide (MP), 10 ng/mL leptin+50 jig/mL human peptide (HP), or 10 ng/mL leptin+2 
mg/mL lactoferrin (lacto). At the end of the experiment, an aliquot from each well was placed in 
a scintillation vial, and radioactivity was determined. 

The transport of molecules through the endothelial monolayer was determined for each 

25 time point as % passage: % passage of radiolabelled molecule through the endothelium: dpm found 
in the lower compartment at a time point divided by the initial dpm found in the upper 
compartment: % transport at 30min = (lower dpm t30 / upper dpm)* 100 

Results 

30 Figure 33 shows an increased transport of radiolabelled leptin over time through the 

endothelium monolayer after 16 days of coculture. The addition of unlabelled leptin reduced 
the amount of leptin by approximately 30%, indicating that there is a specific component 
involved in the transport of leptin across the EC monolayer. A higher concentration of 
unlabelled leptin is needed to decrease the effect of nonspecific processes. The specific 

35 component involved in leptin transport is associated with the complete differentiation and 
formation of the BBB. 
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Lactoferrin, an inhibitor of LSR, significantly inhibited the amount of leptin transported. 
The mouse leptin peptide fragment had no significant effect on leptin transport. However, the 
addition of human leptin peptide fragment caused a significant increase in the amount of leptin 
transcytosis. This same peptide fragment increases LSR activity in human hepatocytes. 
5 The integrity of the BBB was tested using sucrose and inulin (Figure 34 A and 34B). It is 

clear that the integrity of the BBB was not significantly compromised by the addition of leptin, 
the peptides, or lactoferrin. Hence, we can conclude that the trancytosis measured in Figure 33 
represents active processes, and is not due to disintegration of the EC monolayer. 

Thus the invention is drawn to inhibitors and activators of LSR as a means for 
1 0 controlling the transport of leptin across the blood brain brain. Agents directed towards 
activation or inhibition of brain LSR regulate leptin transport into the CNS where it acts as 
satiety factor. 

While preferred embodiments of the invention has been illustrated and described, it will 
be appreciated that various changes can be made by one skilled in the art without departing from 
1 5 the spirit and scope of the invention. 

Example 19: Effect of Longterm Exposure to High Levels of Leptin on LSR Activity 

Human liver cells preincubated with 200 ng/mL human recombinant leptin for 24 h had a 
markedly reduced LSR activity (Fig. 35A, □), as compared to those not preincubated with leptin 

20 (Fig. 35 A, ■). Leptin retained its ability to acutely increase oleate-induced 125 I-LDL binding to 
LSR in a subsequent short incubation (Fig. 35A, □). However, the maximal stimulatory effect 
was reduced by about 50%, and was achieved only with higher leptin concentrations (100 
ng/mL). In hepatocytes preincubated for 24 h with high doses of leptin (200-400 ng/mL), a 25- 
35% decrease of hepatocyte LSR mRNA relative to GAPDH was observed, as compared to 

25 control cells (Fig. 35B). 

Although not wishing to be limited by any particular theory, these data suggest that the 
consistently elevated leptin levels in db/db mice cause a decrease in LSR expression, as well as 
cause a reduction in leptin's ability to acutely stimulate the receptor. This, and the fact that 
plasma leptin did not increase in db Pas /db Pas after the test meal could explain the massively- 

30 elevated postprandial lipemic response observed in this strain. However, because leptin 
signaling to LSR proceeds independent of the Ob-R, acute increase in plasma leptin 
concentrations obtained with injection of 500-50,000 ng of leptin in db/db mice could accelerate 
the removal of lipid by activating LSR. 

Based on these observations, it is likely that 1) the reduced LSR activity, caused by the 

35 constantly high levels of circulating leptin, and 2) the lack of increase in plasma leptin levels 
during the postprandial stage, contribute to elevated postprandial plasma TG levels in db/db. It 
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should be noted that the dose of leptin regulating postprandial lipemia in ob/ob is ~ 500 fold 
lower than those typically used to reduce food intake (2). In db/db mice, leptin doses 10 fold 
greater than those used in ob/ob mice were needed to achieve maximal regulation of postprandial 
lipemia. Thus, the regulation of postprandial lipemia in db/db mice appears partially leptin- 
5 resistant, despite the fact that leptin signaling effect occurs independently of the Ob-R. 
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What is claimed is : 

CLAIMS 

1 . A leptin polypeptide fragment that modulates an activity of LSR, comprising at least 4, 
but not more than 50 contiguous amino acids of a polypeptide sequence selected from 
the group consisting of SEQ DD NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 
NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO:38 and SEQ ID NO:39, wherein said at least 4 and 
not more than 50 contiguous amino acids include the leptin fragment central sequence. 



2. The fragment of claim 1, comprising at least 10, but not more than 50 contiguous amino 
acids of said polypeptide sequence, wherein said at least 10 and not more than 50 
contiguous amino acids include said leptin fragment central sequence. 

3. The fragment of claim 2, comprising at least 20, but not more than 40 contiguous amino 
acids of said polypeptide sequence, wherein said at least 20 and not more than 40 
contiguous amino acids include said leptin fragment central sequence. 

4. The fragment of claim 3, comprising at least 20, but not more than 30 contiguous amino 
acids of said polypeptide sequence, wherein said at least 20 and not more than 30 
contiguous amino acids include said leptin fragment central sequence. 

5. The fragment of claim 1, comprising a 22 contiguous amino acid sequence, that is at 
least 75% identical to said leptin fragment variable region of said polypeptide sequence. 

6. The fragment of claim 5, wherein said 22 contiguous amino acid sequence is at least 
85% identical to said leptin fragment variable region. 

7. The fragment of claim 6, wherein said 22 contiguous amino acid sequence is at least 
95% identical to said leptin fragment variable region of said polypeptide sequence. 



8. The fragment of any one of claims 1 to 7, wherein said polypeptide sequence is SEQ ED 
NO:32. 



9. 



The fragment of any one of claims 1 to 7, wherein said polypeptide sequence is SEQ ID 
NO:34. 
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10. A polynucleotide encoding said leptin fragment of any one of claims 1 to 9, or the 
complement of a polynucleotide encoding said leptin fragment of any one of claims 1 to 
9. 

5 

11. A recombinant cell comprising said polynucleotide of claim 10. 

12. A recombinant vector comprising said polynucleotide of claim 11. 

10 13. A recombinant cell comprising said vector of claim 12. 

14. A pharmaceutical composition comprising said leptin fragment of any one of claims 1 to 
9 and a pharmaceutically acceptable diluent. 

15 15. A method of preventing or treating an obesity-related disease or disorder comprising 

providing to an individual in need of such treatment said pharmaceutical composition of 
claim 14. 

16. The method of claim 15, wherein said obesity-related disease or disorder is 
20 selected from the group consisting of obesity, anorexia, cachexia, cardiac 

insufficiency, coronary insufficiency, stroke, hypertension, atheromatous 
disease, atherosclerosis, high blood pressure, non-insulin-dependent diabetes, 
hyperlipidemia, hyperuricemia, congenital generalized lipodystrophy, and 
Syndrome X. 



25 



17. The method of claim 16, wherein said individual is a mammal. 



18. A method of designing mimetics of a leptin fragment that modulates an activity of LSR, 
comprising: 

30 (a) identifying critical interactions between one or more amino acids of said leptin 

fragment of any one of claims 1 to 9 and LSR; 

(b) designing potential mimetics to comprise said critical interactions; and 

(c) testing said potential mimetics ability to modulate said activity as a means for 
designing said mimetics. 

35 
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19. The method of claim 18, wherein said activity is selected from the group consisting of 
leptin binding, leptin uptake, leptin degradation, triglyceride binding, triglyceride 
uptake, and triglyceride degradation. 

20. The method of claim 18, wherein said critical interactions are selected from the group 
consisting of hydrogen bonding, covalent bonding, Van der Waals forces, steric 
hindrances, and hydrophobic interactions. 

21. The method of claim 18, wherein said critical interactions are identified using assays 
selected from the group consisting of NMR, X-ray crystallography, and computer 
modeling. 

22. A chimeric oligonucleotide, comprising at least 9 contiguous nucleotides from a 
sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
and SEQ ID NO: 16, wherein said at least 9 contiguous nucleotides comprise at least one 
amino acid codon selected from the group consisting of TTA, TTG, TCA, TCG, TAU, 
TAC, TGT, TGC, TGG, CAA, CAG, AGA, GAA, GAG, and GGA, and wherein a point 
mutation is present in said codon such that said codon is a stop codon. 

23. A method of inhibiting the expression of at least one subunit of LSR, comprising 
providing to a cell said oligonucleotide of claim 22. 

24. The method of claim 23, wherein said cell is selected from the group consisting of PLC, 
CHO-K1, HepG2, Hepa 1-6, and Hep3B. 

25. A chimeric oligonucleotide, comprising at least 9 contiguous nucleotides of SEQ ED 
NO:l, wherein said at least 9 contiguous nucleotides comprise a single nucleotide 
polymorphism selected from the group consisting of Al to A32. 

26. A zinc finger protein, comprising a DNA binding domain that binds specifically to 18 
nucleotides of a sequence at least 50% homologous to SEQ ID NO:l, wherein said 18 
nucleotides comprise two fragments of 9 contiguous nucleotides, and wherein said 
fragments are separated by 0, 1, 2, or 3 nucleotides. 
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27. The protein of claim 26, wherein said sequence is at least 50% homologous to sequences 
selected from the group consisting of 1-2356, 2357 to 3539, 3885 to 12162, 12283 to 
15143, 15201 to 17764, 15912 to 19578, 19753 to 19898, 19959 to 20055, 20188 to 
20328, and 20958 to 21046 of SEQ ID NO:l. 

5 

28. The protein of claim 27, wherein said sequence is at least 50% homologous to residues 
2357 to 3539 of SEQ ID NO: 1. 

29. The protein of any one of claims 26 to 28, wherein said protein further comprises a 
10 functional domain selected from the group consisting of a transcription repressor and a 

transcription initiator. 

30. The protein of claim 29, wherein said repressor is a KRAB repressor. 

15 31. The protein of claim 29, wherein said initiator is a VP 1 6 initiator. 

32. The protein of any one of claims 26 to 31, wherein said protein further comprises a 
small molecule regulatory system. 

20 33. The protein of claim 32, wherein said system is selected from the group consisting of a 

Tet system, RU486, and ecdysone. 

34. A polynucleotide encoding said protein of any one of claims 26 to 33. 

25 35. A method of modulating the expression of at least one subunit of LSR, comprising 

providing to a cell said polynucleotide of claim 34. 

36. The method of claim 35, wherein said cell is selected from the group consisting of PLC, 
CHO-K1, HepG2, Hepa 1-6, and Hep3B. 



30 



37, The method of claim 35, wherein said cell is in a mammal. 

38. The method of claim 37, wherein said mammal is a mouse. 



35 39. 



A recombinant vector comprising said polynucleotide of claim 34. 
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40. The vector of claim 39, wherein said vector is an adenovirus associated virus. 

41. A method of treating or preventing an obesity-related disease or disorder comprising 
providing to an individual in need of such treatment a treatment selected from the group 

5 consisting of said oligonucleotide of either of claims 22 or 25, said polynucleotide of 

claim 34, and said vector of either one of claims 39 or 40. 

42. The method of claim 41, wherein said individual is a mammal. 

10 43. The method of either one of claims 41 and 42, wherein said providing comprises a 

liposome. 

44. A non-human mammal comprising said polynucleotide of claim 34 or said vector of 
either one of claims 39 or 40. 

15 

45. A recombinant cell comprising said polynucleotide of claim 34 or said vector of either 
one of claims 39 or 40. 

46. The cell of claim 45, wherein said cell is selected from the group consisting of PLC, 
20 CHO-K1, Hep3B, Hepa 1-6 and HepG2. 

47. The recombinant cell of either one of claims 45 or 46, wherein said recombinant 
cell is transfected with at least one LSR polypeptide comprising a sequence at 
least 75% identical to an amino acid sequence selected from the group consisting 

25 of SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 1 1, 

SEQ ID NO: 1 3, SEQ ID NO: 1 7, SEQ ID NO: 1 8, and SEQ ID NO: 1 9. 

48. The recombinant cell of claim 47, wherein said transfected cell is stably transfected. 

30 49. A method for selecting a compound useful for the treatment or prevention of an obesity- 

related disease or disorder, comprising: 

(a) contacting said recombinant cell of any one of claims 45 to 48 with a 
candidate compound; and 

(b) detecting a result selected from the group consisting of a modulation of an 
35 activity of the Lipolysis Stimulated Receptor and modulation of expression 

of the Lipolysis Stimulated Receptor; as a method for 
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(c) selecting said compound useful for the treatment or prevention of said 
obesity-related disease or disorder. 

50. The method of claim 49, wherein said contacting is in the presence of a ligand of 
5 said Lipolysis Stimulated Receptor. 

5 1 . The method of claim 50, wherein said ligand is selected from the group 
consisting of cytokine, lipoprotein, free fatty acids, Apml, and Clq. 

1 0 52. The method of claim 5 1 , wherein said cytokine is leptin. 

53. The method of claim 52, wherein said leptin is said leptin polypeptide fragment of any 
one of claims 1 to 9. 

1 5 54. The method of any one of claims 49 to 53, wherein said activity is selected from 

the group consisting of binding of lipoproteins, uptake of lipoproteins, 
degradation of lipoproteins, binding of leptin, uptake of leptin, and degradation 
of leptin. 

20 55. The method of any one of claims 49 to 54, wherein said modulation is an 

increase in said activity. 

56. The method of any one of claims 49 to 54, wherein said modulation is a decrease 
in said activity. 

25 

57. The method of any one of claims 49 to 56, wherein said expression is on the 
surface of said cell. 

58. The method of any one of claims 49 to 57, wherein said detecting comprises 
30 FACS. 

59. The method of any one of claims 49 to 58, wherein said detecting further 
comprises antibodies that bind specifically to said LSR, wherein said LSR 
comprises an amino acid sequence at least 75% homologous to at least one of the 

35 sequences selected from the group consisting of SEQ ED NO:3, SEQ ID NO:5, 
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SEQ ID NO:7, SEQ ID N0:9, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: ! 7, 
SEQ ID NO:18, and SEQ ID NO:19. 

60. The method of claim 59, wherein said antibodies bind specifically to a region of 
5 said LSR selected from the group consisting of an amino terminus, a carboxy 

terminus, a splice site, a cytokine binding site, a fatty acid binding site, a clathrin 
binding site, an apoprotein ligand binding site, a LI/LL motif, a RSRS motif, and 
a hydrophobic region. 

10 61 . The method of any one of claims 49 to 60, wherein said cell is selected from the group 

consisting of PLC, CHO-K1, Hep3B, and HepG2. 

62. The method of any one of claims 49 to 61, wherein said candidate compound is 
selected from the group consisting of peptides, peptide libraries, non-peptide 

1 5 libraries, peptoids, fatty acids, lipoproteins, medicaments, antibodies, and small 

molecules. 

63. The method of any one of claims 49 to 62, wherein said obesity-related diseases 
and disorders are selected from the group consisting of obesity, anorexia, 

20 cachexia, cardiac insufficiency, coronary insufficiency, stroke, hypertension, 

atheromatous disease, atherosclerosis, high blood pressure, non-insulin- 
dependent diabetes, hyperlipidemia, hyperuricemia, , congenital generalized 
lipodystrophy and Syndrome X. 

25 64. A method of selecting for genes that modulate an activity of the Lipolysis Stimulated 

Receptor, comprising : 

(a) providing a retroviral gene library to cells that express said Lipolysis 
Stimulated Receptor; 

(b) contacting said cells with a ligand of said Lipolysis Stimulated Receptor ; 

30 (c) detecting a change in said activity of the Lipolysis Stimulated Receptor as a 

means for selecting for said genes. 

65. The method of claim 64, wherein said retroviral gene library comprises a cDNA library 
from tissues selected from the group consisting of liver, muscle, brain, and adipose. 

35 
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66. The method of either one of claims 64 or 65, wherein said retroviral gene library further 
comprises a detectable marker protein selected from the group consisting of GFP, 
truncated CD2, and truncated CD4. 

67. The method of claim 66, further comprising selecting said cells of (a) for moderate 
expression of GFP. 

68. The method of claim 67, wherein said selecting of cells is by FACS. 

69. The method of any one of claims 64 to 68, wherein said ligand is selected from 
the group consisting of cytokine, lipoprotein, free fatty acids, Apml, and Clq. 

70. The method of claim 69, wherein said cytokine is leptin. 

71. The method of claim 70, wherein said leptin is said leptin polypeptide fragment of any 
one of claims 1 to 9. 

72. The method of any one of claims 64 to 71, wherein said detecting a change in said 
activity is by FACS. 

73. The method of claim 72, wherein said detecting further comprises fluorescent 
antibodies that bind specifically to said LSR, wherein said LSR comprises an 
amino acid sequence at least 75% homologous to at least one of the sequences 
selected from the group consisting of SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO:7, SEQ ID NO:9, SEQ ED NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 17, SEQ ID 
NO:18, and SEQIDNO:19. 

74. The method of claim 73, wherein said antibodies bind specifically to a region of 
said LSR selected from the group consisting of an amino terminus, a carboxy 
terminus, a splice site, a cytokine binding site, a fatty acid binding site, a clathrin 
binding site, an apoprotein ligand binding site, a LI/LL motif, a RSRS motif, and 
a hydrophobic region. 

75. The method of any one of claims 64 to 74, wherein said cell is selected from the group 
consisting of PLC, CHO-K1, Hep3B, Hepa 1-6, and HepG2. 
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Effect of test meal with and without leptin injection on postheparin lipolytic 

activity in db Pas /db Pas mice 





1 Postheparin lipolytic 
! activities in db Pas /db Pas 
\ (\imo\ FFA/ml/h) 


No high-fat test meal 


: 11.7 ±2.4 


Hign-iat test meal 


, 19.5 ± 9.2 ns 


High-fat test meal + 50 ng 
leptin 


; 12.2 ± 2.7 ns 


ns = not significant). 




Figure 12 
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Figure 14 
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Figure 17A 



Degradation of LDL in PLC cells transienctly 
transfected with D/N contructs 5 & 6 
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Figure 17B 



Degradation of LDL in PLC cells transienctly 
transfected with D/N contructs 5 & 6 
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- - transfected with D/N contructs 5 & 6 
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LOCUS pSBS5182-N 6319 bp DNA CIRCULAR SYN 

DEFINITION Ligation of 5182 into NVF (Kpnl, BamHI) 
ACCESSION pSBS5182-N 
REFERENCE 1 (bases 1 to 6319) 
FEATURES Location/Qualifiers 
CDS 956.. 1003 

/gene="NLS fl 

/product = "Nuclear Localization Signal" 
CDS 1004.. 1597 

/gene="ZFP" 

/product="LSR 2B-1A" 
CDS 1598.. 1840 

/gene="VP16" 

/product ="VP16 activation domain" 
CDS 1841.. 1867 

/ gene = "FLAG" 

/product = "FLAG epitope" 
CDS 3064.. 3947 

/gene="Neo" 

/product* "neomycin resistance" 
CDS complement (5321.. 6181) 

/gene="Amp " 

/product= "Ampcillin resistance" 
BASE COUNT 1451 a 1683 C 1651 g 1534 t 

ORIGIN 

1 GACGGATCGG 
61 CCGCATAGTT 

121 CGAGCAAAAT 

181 TTAGGGTTAG 

241 GATTATTGAC 

301 TGGAGTTCCG 

361 CCCGCCCATT 

421 ATTGACGTCA 

4 81 ATCATATGCC 

541 ATGCCCAGTA 

601 TCGCTATTAC 

661 ACTCACGGGG 

721 AAAATCAACG 

781 GTAGGCGTGT 

841 CTGCTTACTG 

901 GTTTAAACTT 

961 CCCCAAGAAG 
1021 CTGCCACATC 
1081 GCGCTGGCAC 
1141 CACCCAGTCC 
1201 TTGTCCGGAA 
1261 CCACCAGAAC 
1321 GAAGAAGCAG 
1381 CCTGACCCGC 
1441 CTGTGGTAAA 



GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 
TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 
ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 
CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 
ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 
ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 
GCTTATCGAA ATTAATACGA CTCACTATAG GG AG AC CC AA GCTGGCTAGC 
AAGCTGATCC ACTAGTCCAG TGTGGTGGAA TTCGCTAGCG CCACCATGGC 
AAGAGGAAGG TGGGAATCCA TGGGGTACCG GGCAAGAAGA AGCAGCACAT 
CAGGGCTGTG GTAAAGTTTA CGGCGACCGC TCCAACCTGA CCCGCCACCT 
ACCGGCGAGA GGCCTTTCAT GTGTACATGG TCCTACTGTG GTAAACGCTT 
GGCGACCTGA CCCGCCACAA GCGTACCCAC ACCGGTGAGA AGAAATTTGC 
TGTCCGAAGC GCTTCATGAT GTCCCACCAC CTGTCCCGCC ACATCAAGAC 
AAGAAGGGTG GATCTGGTGA TGGTGGCCGT CGCGGTGCCG GTTCTGGCAA 
CACATCTGCC ACATCCAGGG CTGTGGTAAA GTTTACGGCG AGCGCGGCGA 
CACCTGCGCT GGCACACCGG CGAGAGGCCT TTCATGTGTA CATGGTCCTA 
CGCTTCACCG ACCCGGGCGC CCTGGTGCGC CACAAGCGTA CCCACACCGG 



Figure 26F 
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1501 TGAGAAGAAA TTTGCTTGTC CGGAATGTCC 
1561 CCAGCACATC AAGACCCACC AGAACAAGAA 
1621 CCTGGGGGAC GAGCTCCACT TAGACGGCGA 
1681 AGACGATTTC GATCTGGACA TGTTGGGGGA 
1741 CCACGACTCC GCCCCCTACG GCGCTCTGGA 
1801 TACCGATGCC CTTGGAATTG ACGAGTACGG 
1861 CAAGTAAGCT TCTCGAGTCT AGAGGGCCCG 
1921 CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 
1981 GTGCCACTCC CACTGTCCTT TCCTAATAAA 
2041 GGTGTCATTC TATTCTGGGG GGTGGGGTGG 
2101 ACAATAGCAG GCATGCTGGG GATGCGGTGG 
2161 GCTGGGGCTC TAGGGGGTAT CCCCACGCGC 
2221 TGGTGGTTAC GCGCAGCGTG ACCGCTACAC 
22 81 CTTTCTTCCC TTCCTTTCTC GCCACGTTCG 
2341 GCATCCCTTT AGGGTTCCGA TTTAGTGCTT 
24 01 AGGGTGATGG TTCACGTAGT GGGCCATCGC 
24 61 TGGAGTCCAC GTTCTTTAAT AGTGGACTCT 
2521 TCTCGGTCTA TTCTTTTGAT TTATAAGGGA 
2581 ATGAGCTGAT TTAACAAAAA TTTAACGCGA 
2641 GTGTGGAAAG TCCCCAGGCT CCCCAGGCAG 
2701 AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG 
2761 TGCATCTCAA TTAGTCAGCA ACCATAGTCC 

2 821 CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC 
2881 AGGCCGAGGC CGCCTCTGCC TCTGAGCTAT 
2941 GCCTAGGCTT TTGCAAAAAG CTCCCGGGAG 
3001 AGACAGGATG AGGATCGTTT CGCATGATTG 
3061 CCGCTTGGGT GGAGAGGCTA TTCGGCTATG 
3121 ATGCCGCCGT GTTCCGGCTG TCAGCGCAGG 
3181 TGTCCGGTGC CCTGAATGAA CTGCAGGACG 
3241 CGGGCGTTCC TTGCGCAGCT GTGCTCGACG 
3301 TATTGGGCGA AGTGCCGGGG CAGGATCTCC 

3 361 TATCCATCAT GGCTGATGCA ATGCGGCGGC 
3421 TCGACCACCA AGCGAAACAT CGCATCGAGC 
34 81 TCGATCAGGA TGATCTGGAC GAAGAGCATC 

3 541 GGCTCAAGGC GCGCATGCCC GACGGCGAGG 
3601 TGCCGAATAT CATGGTGGAA AATGGCCGCT 
3661 GTGTGGCGGA CCGCTATCAG GACATAGCGT 
3721 GCGGCGAATG GGCTGACCGC TTCCTCGTGC 
3781 GCATCGCCTT CTATCGCCTT CTTGACGAGT 
3841 GACCGACCAA GCGACGCCCA ACCTGCCATC 
3901 TGAAAGGTTG GGCTTCGGAA TCGTTTTCCG 
3961 GGATCTCATG CTGGAGTTCT TCGCCCACCC 

4 021 CAAATAAAGC AATAGCATCA CAAATTTCAC 
4081 TTGTGGTTTG TCCAAACTCA TCAATGTATC 
4141 CTAGAGCTTG GCGTAATCAT GGTCATAGCT 

42 01 AATTCCACAC AACATACGAG CCGGAAGCAT 
4261 GAGCTAACTC ACATTAATTG CGTTGCGCTC 
4321 GTGCCAGCTG CATTAATGAA TCGGCCAACG 

43 81 CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 



GAAGCGCTTC ATGCGCTCCG ACAACCTGAC 
GGGTGGATCC GCCCCCCCGA CCGATGTCAG 
GGACGTGGCG ATGGCGCATG CCGACGCGCT 
CGGGGATTCC CCGGGGCCGG GATTTACCCC 
TATGGCCGGC TTCGAGTTTG AGCAGATGTT 
TGGGGGCAGC GACTACAAGG ACGACGATGA 
TTTAAACCCG CTGATCAGCC TCGACTGTGC 
CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
GCTCTATGGC TTCTGAGGCG GAAAGAACCA 
CCTGTAGCGG CGCATTAAGC GCGGCGGGTG 
TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG 
CCGGCTTTCC CCGTCAAGCT CTAAATCGGG 
TACGGCACCT CGACCCCAAA AAACTTGATT 
CCTGATAGAC GGTTTTTCGC CCTTTGACGT 
TGTTCCAAAC TGGAACAACA CTCAACCCTA 
TTTTGGGGAT TTCGGCCTAT TGGTTAAAAA 
ATTAATTCTG TGGAATGTGT GTCAGTTAGG 
GCAGAAGTAT GCAAAGCATG CATCTCAATT 
GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA 
CGCCCCTAAC TCCGCCCATC CCGCCCCTAA 
ATGGCTGACT AATTTTTTTT ATTTATGCAG 
TCCAGAAGTA GTGAGGAGGC TTTTTTGGAG 
CTTGTATATC CATTTTCGGA TCTGATCAAG 
AACAAGATGG ATTGCACGCA GGTTCTCCGG 
ACTGGGCACA ACAGACAATC GGCTGCTCTG 
GGCGCCCGGT TCTTTTTGTC AAGACCGACC 
AGGCAGCGCG GCTATCGTGG CTGGCCACGA 
TTGTCACTGA AGCGGGAAGG GACTGGCTGC 
TGTCATCTCA CCTTGCTCCT GCCGAGAAAG 
TGCATACGCT TGATCCGGCT ACCTGCCCAT 
GAG C ACGT AC TCGGATGGAA GCCGGTCTTG 
AGGGGCTCGC GCCAGCCGAA CTGTTCGCCA 
ATCTCGTCGT GACCCATGGC GATGCCTGCT 
TTTCTGGATT CATCGACTGT GGCCGGCTGG 
TGGCTACCCG TGATATTGCT GAAGAGCTTG 
TTTACGGTAT CGCCGCTCCC GATTCGCAGC 
TCTTCTGAGC GGGACTCTGG GGTTCGAAAT 
ACGAGATTTC GATTCCACCG CCGCCTTCTA 
GGACGCCGGC TGGATGATCC TCCAGCGCGG 
CAACTTGTTT ATTGCAGCTT ATAATGGTTA 
AAATAAAGCA TTTTTTTCAC TGCATTCTAG 
TTATCATGTC TGTATACCGT CGACCTCTAG 
GTTTCCTGTG TGAAATTGTT ATCCGCTCAC 
AAAGTGTAAA GCCTGGGGTG CCTAATGAGT 
ACTGCCCGCT TTCCAGTCGG GAAACCTGTC 
CGCGGGGAGA GGCGGTTTGC GTATTGGGCG 
GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT 
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4441 ATCAGCTCAC TCAAAGGCGG TAATACGGT' 
4501 GAACATGTGA GCAAAAGGCC AGCAAAAGGC 
4561 GTTTTTCCAT AGGCTCCGCC CCCCTGACGA 
4621 GTGGCGAAAC CCGACAGGAC TATAAAGATA 
4681 GCGCTCTCCT GTTCCGACCC TGCCGCTTAC 
4741 AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 
4801 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC 
4861 TAACTATCGT CTTGAGTCCA ACCCGGTAAG 
4921 TGGTAACAGG ATTAGCAGAG CGAGGTATGT 
4981 GCCTAACTAC GGCTACACTA GAAGGACAGT 
5041 TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
5101 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 
5161 TTTGATCTTT TCTACGGGGT CTGACGCTCA 
5221 GGTCATGAGA TTATCAAAAA GGATCTTCAC 
5281 TAAATCAATC TAAAGTATAT ATGAGTAAAC 
5341 TGAGGCACCT ATCTCAGCGA TCTGTCTATT 
5401 CGTGTAGATA ACTACGATAC GGGAGGGCTT 
54 61 GCGAGACCCA CGCTCACCGG CTCCAGATTT 
5521 CGAGCGCAGA AGTGGTCCTG CAACTTTATC 
5581 GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA 
5641 AGGCATCGTG GTGTCACGCT CGTCGTTTGG 
5701 ATCAAGGCGA GTTACATGAT CCCCCATGTT 
5761 TCCGATCGTT GTCAGAAGTA AGTTGGCCGC 
5821 GCATAATTCT CTTACTGTCA TGCCATCCGT 
5881 AACCAAGTCA TTCTGAGAAT AGTGTATGCG 
5941 ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 
60 01 TTCGGGGCGA AAACTCTCAA GGATCTTACC 
6061 TCGTGCACCC AACTGATCTT CAGCATCTTT 
6121 AACAGGAAGG CAAAATGCCG CAAAAAAGGG 
6181 CATACTCTTC CTTTTTCAAT ATTATTGAAG 
6241 ATACATATTT GAATGTATTT AGAAAAATAA 
6301 AAAAGTGCCA CCTGACGTC 



: ATCCACAGAA TCAGGGGATA ACGCAGGAAA 
CAGGAACCGT AAAAAGGCCG CGTTGCTGGC 
GCATCACAAA AATCGACGCT CAAGTCAGAG 
CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT 
CGGATACCTG TCCGCCTTTC TCCCTTCGGG 
TAGGTATCTC AGTTCGGTGT AGGTCGTTCG 
CGTTCAGCCC GACCGCTGCG CCTTATCCGG 
ACACGACTTA TCGCCACTGG CAGCAGCCAC 
AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG 
ATTTGGTATC TGCGCTCTGC TGAAGCCAGT 
ATCCGGCAAA CAAACCACCG CTGGTAGCGG 
GCGCAGAAAA AAAGGATCTC AAGAAGATCC 
GTGGAACGAA AACTCACGTT AAGGGATTTT 
CTAGATCCTT TTAAATTAAA AATGAAGTTT 
TTGGTCTGAC AGTTACCAAT GCTTAATCAG 
TCGTTCATCC ATAGTTGCCT GACTCCCCGT 
ACCATCTGGC CCCAGTGCTG CAATGATACC 
ATCAGCAATA AACCAGCCAG CCGGAAGGGC 
CGCCTCCATC CAGTCTATTA ATTGTTGCCG 
TAGTTTGCGC AACGTTGTTG CCATTGCTAC 
TATGGCTTCA TTCAGCTCCG GTTCCCAACG 
GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC 
AGTGTTATCA CTCATGGTTA TGGCAGCACT 
AAGATGCTTT TCTGTGACTG GTGAGTACTC 
GCGACCGAGT TGCTCTTGCC CGGCGTCAAT 
TTTAAAAGTG CTCATCATTG GAAAACGTTC 
GCTGTTGAGA TCCAGTTCGA TGTAACCCAC 
TACTTTCACC AGCGTTTCTG GGTGAGCAAA 
AATAAGGGCG ACACGGAAAT GTTGAATACT 
CATTTATCAG GGTTATTGTC TCATGAGCGG 
ACAAATAGGG GTTCCGCGCA CATTTCCCCG 
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LOCUS 

DEFINITION 
ACCESSION 
REFERENCE 
FEATURES 
CDS 



CDS 



CDS 



CDS 



CDS 



CDS 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 



pSBS5183-N 6319 bp DNA CIRCULAR SYN 

Ligation of 5183 into NVF (Kpnl, BamHI) 

pSBS5183-N 

1 (bases 1 to 6319) 

Location/Qualifiers 

956. .1003 

/gene="NLS " 

/product= "Nuclear Localization Signal' 1 

1004 . .1597 

/gene="ZFP" 

/product ="LSR 4A-3A" 

1598. .1840 

/gene="VP16" 

/product =s"VP16 activation domain" 

1841. .1867 

/gene="FLAG" 

/product =" FLAG epitope" 

3064 . .3947 

/gene="Neo" 

/product =" neomycin resistance" 
complement (5321 . . 6181) 
/gene = "Amp " 

/product ="Ampc ill in resistance" 
1446 a 1683 c 1655 g 1535 t 



GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 
GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 
ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 
ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 
661 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
721 AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 
781 GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 
841 CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 
901 GTTTAAACTT AAGCTGATCC ACTAGTCCAG TGTGGTGGAA TTCGCTAGCG CCACCATGGC 
961 CCCCAAGAAG AAGAGGAAGG TGGGAATCCA TGGGGTACCG GGCAAGAAGA AGCAGCACAT 
1021 CTGCCACATC CAGGGCTGTG GTAAAGTTTA CGGCCAGTCC GGCCACCTGG CCCGCCACCT 
1081 GCGCTGGCAC ACCGGCGAGA GGCCTTTCAT GTGTACATGG TCCTACTGTG GTAAACGCTT 
1141 CACCACCTCC GGCGAGCTGG TGCGCCACAA GCGTACCCAC ACCGGTGAGA AGAAATTTGC 
1201 TTGTCCGGAA TGTCCGAAGC GCTTCATGCG TTCCGACCAC CTGTCCCGTC ACATCAAGAC 
1261 CCACCAGAAC AAGAAGGGTG GATCTGGTGA TGGTGGCCGT CGCGGTGGCG GTTCTGGCAA 
1321 GAAGAAGCAG CACATCTGCC ACATCCAGGG CTGTGGTAAA GTTTACGGCG AGCGCGGCGA 
1381 CCTGACCCGC CACCTGCGCT GGCACACCGG CGAGAGGCCT TTCATGTGTA CATGGTCCTA 
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1441 CTGTGCTAAA CGCTTCACCC AGCGCGCCCA 
1501 TGAGAAGAAA TTTGCTTGTC CGGAATGTCC 
1561 CCGCCACATC AAGACCCACC AGAACAAGAA 
1621 CCTGGGGGAC GAGCTCCACT TAGACGGCGA 
1681 AGACGATTTC GATCTGGACA TGTTGGGGGA 
1741 CCACGACTCC GCCCCCTACG GCGCTCTGGA 
1801 TACCGATGCC CTTGGAATTG ACGAGTACGG 
1861 CAAGTAAGCT TCTCGAGTCT AGAGGGCCCG 
1921 CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 
1981 GTGCCACTCC CACTGTCCTT TCCTAATAAA 
2041 GGTGTCATTC TATTCTGGGG GGTGGGGTGG 
2101 ACAATAGCAG GCATGCTGGG GATGCGGTGG 
2161 GCTGGGGCTC TAGGGGGTAT CCCCACGCGC 
2221 TGGTGGTTAC GCGCAGCGTG ACCGCTACAC 
22 81 CTTTCTTCCC TTCCTTTCTC GCCACGTTCG 
2341 GCATCCCTTT AGGGTTCCGA TTTAGTGCTT 
2401 AGGGTGATGG TTCACGTAGT GGGCCATCGC 
2461 TGGAGTCCAC GTTCTTTAAT AGTGGACTCT 
2521 TCTCGGTCTA TTCTTTTGAT TTATAAGGGA 
2581 ATGAGCTGAT TTAACAAAAA TTTAACGCGA 
2641 GTGTGGAAAG TCCCCAGGCT CCCCAGGCAG 
2701 AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG 
2761 TGCATCTCAA TTAGTCAGCA ACCATAGTCC 
2821 CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC 
2881 AGGCCGAGGC CGCCTCTGCC TCTGAGCTAT 
2941 GCCTAGGCTT TTGCAAAAAG CTCCCGGGAG 
3001 AGACAGGATG AGGATCGTTT CGCATGATTG 
3061 CCGCTTGGGT GGAGAGGCTA TTCGGCTATG 
3121 ATGCCGCCGT GTTCCGGCTG TCAGCGCAGG 
3181 TGTCCGGTGC CCTGAATGAA CTGCAGGACG 
3241 CGGGCGTTCC TTGCGCAGCT GTGCTCGACG 
3301 TATTGGGCGA AGTGCCGGGG CAGGATCTCC 
3361 TATCCATCAT GGCTGATGCA ATGCGGCGGC 
3421 TCGACCACCA AGCGAAACAT CGCATCGAGC 
34 81 TCGATCAGGA TGATCTGGAC GAAGAGCATC 
3541 GGCTCAAGGC GCGCATGCCC GACGGCGAGG 
3601 TGCCGAATAT CATGGTGGAA AATGGCCGCT 
3661 GTGTGGCGGA CCGCTATCAG GACATAGCGT 
3721 GCGGCGAATG GGCTGACCGC TTCCTCGTGC 
3781 GCATCGCCTT CTATCGCCTT CTTGACGAGT 
3841 GACCGACCAA GCGACGCCCA ACCTGCCATC 
3901 TGAAAGGTTG GGCTTCGGAA TCGTTTTCCG 
3961 GGATCTCATG CTGGAGTTCT TCGCCCACCC 
4021 CAAATAAAGC AATAGCATCA CAAATTTCAC 
4081 TTGTGGTTTG TCCAAACTCA TCAATGTATC 
4141 CTAGAGCTTG GCGTAATCAT GGTCATAGCT 
4201 AATTCCACAC AACATACGAG CCGGAAGCAT 
4261 GAGCTAACTC ACATTAATTG CGTTGCGCTC 
4321 GTGCCAGCTG CATTAATGAA TCGGCCAACG 
43 81 CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 
4441 ATCAGCTCAC TCAAAGGCGG TAATACGGTT 



CCTGGAGCGC CACAAGCGTA CCCACACCGG 
GAAGCGCTTC ATGCGCTCCG ACGCCCTGAC 
GGGTGGATCC GCCCCCCCGA CCGATGTCAG 
GGACGTGGCG ATGGCGCATG CCGACGCGCT 
CGGGGATTCC CCGGGGCCGG GATTTACCCC 
TATGGCCGGC TTCGAGTTTG AGCAGATGTT 
TGGGGGCAGC GACTACAAGG ACGACGATGA 
TTTAAACCCG CTGATCAGCC TCGACTGTGC 
CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
GCTCTATGGC TTCTGAGGCG GAAAGAACCA 
CCTGTAGCGG CGCATTAAGC GCGGCGGGTG 
TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG 
CCGGCTTTCC CCGTCAAGCT CTAAATCGGG 
TACGGCACCT CGACCCCAAA AAACTTGATT 
CCTGATAGAC GGTTTTTCGC CCTTTGACGT 
TGTTCCAAAC TGGAACAACA CTCAACCCTA 
TTTTGGGGAT TTCGGCCTAT TGGTTAAAAA 
ATTAATTCTG TGGAATGTGT GTCAGTTAGG 
GCAGAAGTAT GCAAAGCATG CATCTCAATT 
GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA 
CGCCCCTAAC TCCGCCCATC CCGCCCCTAA 
ATGGCTGACT AATTTTTTTT ATTTATGCAG 
TCCAGAAGTA GTGAGGAGGC TTTTTTGGAG 
CTTGTATATC CATTTTCGGA TCTGATCAAG 
AACAAGATGG ATTGCACGCA GGTTCTCCGG 
ACTGGGCACA ACAGACAATC GGCTGCTCTG 
GGCGCCCGGT TCTTTTTGTC AAGACCGACC 
AGGCAGCGCG GCTATCGTGG CTGGCCACGA 
TTGTCACTGA AGCGGGAAGG GACTGGCTGC 
TGTCATCTCA CCTTGCTCCT GCCGAGAAAG 
TGCATACGCT TGATCCGGCT ACCTGCCCAT 
GAGCACGTAC TCGGATGGAA GCCGGTCTTG 
AGGGGCTCGC GCCAGCCGAA CTGTTCGCCA 
ATCTCGTCGT GACCCATGGC GATGCCTGCT 
TTTCTGGATT CATCGACTGT GGCCGGCTGG 
TGGCTACCCG TGATATTGCT GAAGAGCTTG 
TTTACGGTAT CGCCGCTCCC GATTCGCAGC 
TCTTCTGAGC GGGACTCTGG GGTTCGAAAT 
ACGAGATTTC GATTCCACCG CCGCCTTCTA 
GGACGCCGGC TGGATGATCC TCCAGCGCGG 
CAACTTGTTT ATTGCAGCTT ATAATGGTTA 
AAATAAAGCA TTTTTTTCAC TGCATTCTAG 
TTATCATGTC TGTATACCGT CGACCTCTAG 
GTTTCCTGTG TGAAATTGTT ATCCGCTCAC 
AAAGTGTAAA GCCTGGGGTG CCTAATGAGT 
ACTGCCCGCT TTCCAGTCGG GAAACCTGTC 
CGCGGGGAGA GGCGGTTTGC GTATTGGGCG 
GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT 
ATCCACAGAA TCAGGGGATA ACGCAGGAAA 
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4501 GAACATGTGA GCAAAAGGCC AGCAAAAGGC 
4561 GTTTTTCCAT AGGCTCCGCC CCCCTGACGA 
4621 GTGGCGAAAC CCGACAGGAC TATAAAGATA 
4681 GCGCTCTCCT GTTCCGACCC TGCCGCTTAC 
4741 AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 
4801 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC 
4861 TAACTATCGT CTTGAGTCCA ACCCGGTAAG 
4921 TGGTAACAGG ATTAGCAGAG CGAGGTATGT 
4 981 GCCTAACTAC GGCTACACTA GAAGGACAGT 
5041 TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
5101 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 
5161 TTTGATCTTT TCTACGGGGT CTGACGCTCA 
5221 GGTCATGAGA TTATCAAAAA GGATCTTCAC 
5281 TAAATCAATC TAAAGTATAT ATGAGTAAAC 
5341 TGAGGCACCT ATCTCAGCGA TCTGTCTATT 
5401 CGTGTAGATA ACTACGATAC GGGAGGGCTT 
54 61 GCGAGACCCA CGCTCACCGG CTCCAGATTT 
5521 CGAGCGCAGA AGTGGTCCTG CAACTTTATC 
5581 GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA 
5641 AGGCATCGTG GTGTCACGCT CGTCGTTTGG 
5701 ATCAAGGCGA GTTACATGAT CCCCCATGTT 
5761 TCCGATCGTT GTCAGAAGTA AGTTGGCCGC 
5821 GCATAATTCT CTTACTGTCA TGCCATCCGT 
5881 AACCAAGTCA TTCTGAGAAT AGTGTATGCG 
5941 ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 
6001 TTCGGGGCGA AAACTCTCAA GGATCTTACC 
6061 TCGTGCACCC AACTGATCTT CAGCATCTTT 
6121 AACAGGAAGG CAAAATGCCG CAAAAAAGGG 
6181 CATACTCTTC CTTTTTCAAT ATTATTGAAG 
6241 ATACATATTT GAATGTATTT AGAAAAATAA 
6301 AAAAGTGCCA CCTGACGTC 



CAGGAACCGT AAAAAGGCCG CGTTGCTGGC 
GCATCACAAA AATCGACGCT CAAGTCAGAG 
CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT 
CGGATACCTG TCCGCCTTTC TCCCTTCGGG 
TAGGTATCTC AGTTCGGTGT AGGTCGTTCG 
CGTTCAGCCC GACCGCTGCG CCTTATCCGG 
ACACGACTTA TCGCCACTGG CAGCAGCCAC 
AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG 
ATTTGGTATC TGCGCTCTGC TGAAGCCAGT 
ATCCGGCAAA CAAACCACCG CTGGTAGCGG 
GCGCAGAAAA AAAGGATCTC AAGAAGATCC 
GTGGAACGAA AACTCACGTT AAGGGATTTT 
CTAGATCCTT TTAAATTAAA AATGAAGTTT 
TTGGTCTGAC AGTTACCAAT GCTTAATCAG 
TCGTTCATCC ATAGTTGCCT GACTCCCCGT 
ACCATCTGGC CCCAGTGCTG CAATGATACC 
ATCAGCAATA AACCAGCCAG CCGGAAGGGC 
CGCCTCCATC CAGTCTATTA ATTGTTGCCG 
TAGTTTGCGC AACGTTGTTG CCATTGCTAC 
TATGGCTTCA TTCAGCTCCG GTTCCCAACG 
GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC 
AGTGTTATCA CTCATGGTTA TGGCAGCACT 
AAGATGCTTT TCTGTGACTG GTGAGTACTC 
GCGACCGAGT TGCTCTTGCC CGGCGTCAAT 
TTTAAAAGTG CTCATCATTG GAAAACGTTC 
GCTGTTGAGA TCCAGTTCGA TGTAACCCAC 
TACTTTCACC AGCGTTTCTG GGTGAGCAAA 
AATAAGGGCG ACACGGAAAT GTTGAATACT 
CATTTATCAG GGTTATTGTC TCATGAGCGG 
ACAAATAGGG GTTCCGCGCA CATTTCCCCG 
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CDS 



BASE COUNT 
ORIGIN 

1 
61 
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301 
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481 
541 
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661 
721 
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961 
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1081 
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pSBS5185-N 62 95 bp DNA CIRCULAR SYN 

Ligation of 5185 into NVF (Kpnl, BamHI) 

pSBS5185-N 

1 (bases 1 to 6295) 

Location/Qualifiers 

956. .1003 

/gene="NLS" 

/product = "Nuclear Localization Signal 1 

1004. .1573 

/gene="ZFP" 

/product="LSR 6A-5A" 

1574. .1816 

/gene="VP16" 

/product="VP16 activation domain" 

1817. .1843 

/gene="FLAG" 

/product= "FLAG epitope" 

3040. .3923 

/gene="Neo ,! 

/product= "neomycin resistance" 
complement (5297 . .6157) 
/gene="Amp " 

/products "Ampcillin resistance" 
1452 a 1682 c 1635 g 1526 t 



GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 
GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 
ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 
ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 
ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 
ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 
GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 
AAGCTGATCC ACTAGTCCAG TGTGGTGGAA TTCGCTAGCG CCACCATGGC 
AAGAGGAAGG TGGGAATCCA TGGGGTACCG GGCAAGAAGA AGCAGCACAT 
CAGGGCTGTG GTAAAGTTTA CGGCCGCTCC GACCACCTGG CCCGCCACCT 
ACCGGCGAGA GGCCTTTCAT GTGTACATGG TCCTACTGTG GTAAACGCTT 
GACGAGCTGC AGCGCCACAA GCGTACCCAC ACCGGTGAGA AGAAATTTGC 
TGTCCGAAGC GCTTCATGCG CTCCGACGAG CGCAAGCGCC ACATCAAGAC 
AAGAAGGGTG GATCTGGTGA TGGCAAGAAG AAGCAGCACA TCTGCCACAT 
GGTAAAGTTT ACGGCCGCTC CGACCACCTG ACCACCCACC TGCGCTGGCA 

GGTAAACGCT TCACCCGCTC 



ACTCACGGGG 
AAAATCAACG 
GTAGGCGTGT 
CTGCTTACTG 
GTTTAAACTT 
CCCCAAGAAG 
CTGCCACATC 
GCGCTGGCAC 
CACCCGCTCC 
1201 TTGTCCGGAA 
1261 CCACCAGAAC 
1321 CCAGGGCTGT 

1381 CACCGGCGAG AGGCCTTTCA TGTGTACATG GTCCTACTGT 



Figure 26L 
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1441 CGACCACCTG ACCCGCCACA AGCGTACCCA 
1501 ATGTCCGAAG CGCTTCATGC GCTCCGACCA 
1561 CAAGAAGGGT GGATCCGCCC CCCCGACCGA 
1621 CGGCGAGGAC GTGGCGATGG CGCATGCCGA 
1681 GGGGGACGGG GATTCCCCGG GGCCGGGATT 
1741 TCTGGATATG GCCGGCTTCG AGTTTGAGCA 
1801 GTACGGTGGG GGCAGCGACT ACAAGGACGA 
1861 GGCCCGTTTA AACCCGCTGA TCAGCCTCGA 
1921 TTTGCCCCTC CCCCGTGCCT TCCTTGACCC 
1981 AATAAAATGA GGAAATTGCA TCGCATTGTC 
2041 GGGTGGGGCA GGACAGCAAG GGGGAGGATT 
2101 CGGTGGGCTC TATGGCTTCT GAGGCGGAAA 
2161 ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG 
2221 CTACACTTGC CAGCGCCCTA GCGCCCGCTC 
22 81 CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA 
2341 GTGCTTTACG GCACCTCGAC CCCAAAAAAC 
2401 CATCGCCCTG ATAGACGGTT TTTCGCCCTT 
24 61 GACTCTTGTT CCAAACTGGA ACAACACTCA 
2521 AAGGGATTTT GGGGATTTCG GCCTATTGGT 
2581 ACGCGAATTA ATTCTGTGGA ATGTGTGTCA 
2641 AGGCAGGCAG AAGTATGCAA AGCATGCATC 
2701 CCCCAGGCTC CCCAGCAGGC AGAAGTATGC 
2761 TAGTCCCGCC CCTAACTCCG CCCATCCCGC 
2821 CGCCCCATGG CTGACTAATT TTTTTTATTT 
2881 AGCTATTCCA GAAGTAGTGA GGAGGCTTTT 
2941 CGGGAGCTTG TATATCCATT TTCGGATCTG 
3001 TGATTGAACA AGATGGATTG CACGCAGGTT 
3061 GCTATGACTG GGCACAACAG ACAATCGGCT 
3121 CGCAGGGGCG CCCGGTTCTT TTTGTCAAGA 
3181 AGGACGAGGC AGCGCGGCTA TCGTGGCTGG 
3241 TCGACGTTGT CACTGAAGCG GGAAGGGACT 
3301 ATCTCCTGTC ATCTCACCTT GCTCCTGCCG 
3361 GGCGGCTGCA TACGCTTGAT CCGGCTACCT 
3421 TCGAGCGAGC ACGTACTCGG ATGGAAGCCG 
3481 AGCATCAGGG GCTCGCGCCA GCCGAACTGT 
3541 GCGAGGATCT CGTCGTGACC CATGGCGATG 
3601 GCCGCTTTTC TGGATTCATC GACTGTGGCC 
3661 TAGCGTTGGC TACCCGTGAT ATTGCTGAAG 
3721 TCGTGCTTTA CGGTATCGCC GCTCCCGATT 
3781 ACGAGTTCTT CTGAGCGGGA CTCTGGGGTT 
3841 GCCATCACGA GATTTCGATT CCACCGCCGC 
3901 TTTCCGGGAC GCCGGCTGGA TGATCCTCCA 

3 961 CCACCCCAAC TTGTTTATTG CAGCTTATAA 
4021 TTTCACAAAT AAAGCATTTT TTTCACTGCA 
4081 TGTATCTTAT CATGTCTGTA TACCGTCGAC 
4141 ATAGCTGTTT CCTGTGTGAA ATTGTTATCC 
4201 AAGCATAAAG TGTAAAGCCT GGGGTGCCTA 
4261 GCGCTCACTG CCCGCTTTCC AGTCGGGAAA 
4321 CCAACGCGCG GGGAGAGGCG GTTTGCGTAT 

4 381 CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG 
4441 ACGGTTATCC ACAGAATCAG GGGATAACGC 



CACCGGTGAG AAGAAATTTG CTTGTCCGGA 
CCTGACCACC CACATCAAGA CCCACCAGAA 
TGTCAGCCTG GGGGACGAGC TCCACTTAGA 
CGCGCTAGAC GATTTCGATC TGGACATGTT 
TACCCCCCAC GACTCCGCCC CCTACGGCGC 
GATGTTTACC GATGCCCTTG GAATTGACGA 
CGATGACAAG TAAGCTTCTC GAGTCTAGAG 
CTGTGCCTTC TAGTTGCCAG CCATCTGTTG 
TGGAAGGTGC CACTCCCACT GTCCTTTCCT 
TGAGTAGGTG TCATTCTATT CTGGGGGGTG 
GGGAAGACAA TAGCAGGCAT GCTGGGGATG 
GAACCAGCTG GGGCTCTAGG GGGTATCCCC 
CGGGTGTGGT GGTTACGCGC AGCGTGACCG 
CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 
ATCGGGGCAT CCCTTTAGGG TTCCGATTTA 
TTGATTAGGG TGATGGTTCA CGTAGTGGGC 
TGACGTTGGA GTCCACGTTC TTTAATAGTG 
ACCCTATCTC GGTCTATTCT TTTGATTTAT 
TAAAAAATGA GCTGATTTAA CAAAAATTTA 
GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC 
TCAATTAGTC AGCAACCAGG TGTGGAAAGT 
AAAGCATGCA TCTCAATTAG TCAGCAACCA 
CCCTAACTCC GCCCAGTTCC GCCCATTCTC 
ATGCAGAGGC CGAGGCCGCC TCTGCCTCTG 
TTGGAGGCCT AGGCTTTTGC AAAAAGCTCC 
ATCAAGAGAC AGGATGAGGA TCGTTTCGCA 
CTCCGGCCGC TTGGGTGGAG AGGCTATTCG 
GCTCTGATGC CGCCGTGTTC CGGCTGTCAG 
CCGACCTGTC CGGTGCCCTG AATGAACTGC 
CCACGACGGG CGTTCCTTGC GCAGCTGTGC 
GGCTGCTATT GGGCGAAGTG CCGGGGCAGG 
AGAAAGTATC CATCATGGCT GATGCAATGC 
GCCCATTCGA CCACCAAGCG AAACATCGCA 
GTCTTGTCGA TCAGGATGAT CTGGACGAAG 
TCGCCAGGCT CAAGGCGCGC ATGCCCGACG 
CCTGCTTGCC GAATATCATG GTGGAAAATG 
GGCTGGGTGT GGCGGACCGC TATCAGGACA 
AGCTTGGCGG CGAATGGGCT GACCGCTTCC 
CGCAGCGCAT CGCCTTCTAT CGCCTTCTTG 
CGAAATGACC GACCAAGCGA CGCCCAACCT 
CTTCTATGAA AGGTTGGGCT TCGGAATCGT 
GCGCGGGGAT CTCATGCTGG AGTTCTTCGC 
TGGTTACAAA TAAAGCAATA GCATCACAAA 
TTCTAGTTGT GGTTTGTCCA AACTCATCAA 
CTCTAGCTAG AGCTTGGCGT AATCATGGTC 
GCTCACAATT CCACACAACA TACGAGCCGG 
ATGAGTGAGC TAACTCACAT TAATTGCGTT 
CCTGTCGTGC CAGCTGCATT AATGAATCGG 
TGGGCGCTCT TCCGCTTCCT CGCTCACTGA 
AGCGGTATCA GCTCACTCAA AGGCGGTAAT 
AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA 
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4501 AAAGGCCAGG AACCGTAAAA AGGCCGCGTT 
4561 TGACGAGCAT CACAAAAATC GACGCTCAAG 
4621 AAGATACCAG GCGTTTCCCC CTGGAAGCTC 
4681 GCTTACCGGA TACCTGTCCG CCTTTCTCCC 
4741 ACGCTGTAGG TATCTCAGTT CGGTGTAGGT 
4801 ACCCCCCGTT CAGCCCGACC GCTGCGCCTT 
4861 GGTAAGACAC GACTTATCGC CACTGGCAGC 
4921 GTATGTAGGC GGTGCTACAG AGTTCTTGAA 
4 981 GACAGTATTT GGTATCTGCG CTCTGCTGAA 
5041 CTCTTGATCC GGCAAACAAA CCACCGCTGG 
5101 GATTACGCGC AGAAAAAAAG GATCTCAAGA 
5161 CGCTCAGTGG AACGAAAACT CACGTTAAGG 
5221 CTTCACCTAG ATCCTTTTAA ATTAAAAATG 
5281 GTAAACTTGG TCTGACAGTT ACCAATGCTT 
5341 TCTATTTCGT TCATCCATAG TTGCCTGACT 
5401 GGGCTTACCA TCTGGCCCCA GTGCTGCAAT 
5461 AGATTTATCA GCAATAAACC AGCCAGCCGG 
5521 TTTATCCGCC TCCATCCAGT CTATTAATTG 
5581 AGTTAATAGT TTGCGCAACG TTGTTGCCAT 
5641 GTTTGGTATG GCTTCATTCA GCTCCGGTTC 
5701 CATGTTGTGC AAAAAAGCGG TTAGCTCCTT 
5761 GGCCGCAGTG TTATCACTCA TGGTTATGGC 
5821 ATCCGTAAGA TGCTTTTCTG TGACTGGTGA 
5881 TATGCGGCGA CCGAGTTGCT CTTGCCCGGC 
5941 CAGAACTTTA AAAGTGCTCA TCATTGGAAA 
6001 CTTACCGCTG TTGAGATCCA GTTCGATGTA 
6061 ATCTTTTACT TTCACCAGCG TTTCTGGGTG 
6121 AAAGGGAATA AGGGCGACAC GGAAATGTTG 
6181 TTGAAGCATT TATCAGGGTT ATTGTCTCAT 
6241 AAATAAACAA ATAGGGGTTC CGCGCACATT 



GCTGGCGTTT TTCCATAGGC TCCGCCCCCC 
TCAGAGGTGG CGAAACCCGA CAGGACTATA 
CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 
TTCGGGAAGC GTGGCGCTTT CTCAATGCTC 
CGTTCGCTCC AAGCTGGGCT GTGTGCACGA 
ATCCGGTAAC TATCGTCTTG AGTCCAACCC 
AGCCACTGGT AACAGGATTA GCAGAGCGAG 
GTGGTGGCCT AACTACGGCT ACACTAGAAG 
GCCAGTTACC TTCGGAAAAA GAGTTGGTAG 
TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 
AGATCCTTTG ATCTTTTCTA CGGGGTCTGA 
GATTTTGGTC ATGAGATTAT CAAAAAGGAT 
AAGTTTTAAA TCAATCTAAA GTATATATGA 
AATCAGTGAG GCACCTATCT CAGCGATCTG 
CCCCGTCGTG TAGATAACTA CGATACGGGA 
GATACCGCGA GACCCACGCT CACCGGCTCC 
AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 
TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 
TGCTACAGGC ATCGTGGTGT CACGCTCGTC 
CCAACGATCA AGGCGAGTTA CATGATCCCC 
CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT 
AGCACTGCAT AATTCTCTTA CTGTCATGCC 
GTACTCAACC AAGTCATTCT GAGAATAGTG 
GTCAATACGG GATAATACCG CGCCACATAG 
ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 
ACCCACTCGT GCACCCAACT GATCTTCAGC 
AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 
AATACTCATA CTCTTCCTTT TTCAATATTA 
GAGCGGATAC ATATTTGAAT GTATTTAGAA 
TCCCCGAAAA GTGCCACCTG ACGTC 
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LOCUS pSBS5186-N 6319 bp DNA CIRCULAR SYN 

DEFINITION Ligation of 5186 into NVF (Kpnl, BamHI) 
ACCESSION pSBS5186-N 
REFERENCE 1 (bases 1 to 6319) 
FEATURES Locat ion/Quali f iers 

CDS 956.. 1003 

/gene="NLS" 

/product= "Nuclear Localization Signal" 
CDS 1004.. 1597 

/gene="ZFP" 

/product="LSR 8A-7B" 
CDS 1598.. 1840 

/gene="VPl6" 

/product*" VP16 activation domain" 
CDS 1841.. 1867 

/gene="FLAG" 

/product* "FLAG epitope" 
CDS 3064.. 3947 

/gene="Neo" 

/product* "neomycin resistance" 
CDS complement (5321.. 6181) 

/ gene = "Amp " 

/products "Ampci 11 in resistance" 
BASE COUNT 1449 a 1687 C 1651 g 1532 t 

ORIGIN 

1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
61 CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
121 CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
181 TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 
241 GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
361 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 
421 ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
481 ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 
541 ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 
661 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
721 AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 
781 GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 
841 CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 
901 GTTTAAACTT AAGCTGATCC ACTAGTCCAG TGTGGTGGAA TTCGCTAGCG CCACCATGGC 
961 CCCCAAGAAG AAGAGGAAGG TGGGAATCCA TGGGGTACCG GGCAAGAAGA AGCAGCACAT 
1021 CTGCCACATC CAGGGCTGTG GTAAAGTTTA CGGCCAGTCC GGCGCCCTGA CCCGCCACCT 
1081 GCGCTGGCAC ACCGGCGAGA GGCCTTTCAT GTGTACATGG TCCTACTGTG GTAAACGCTT 
1141 CACCCGCTCC GACCACCTGA CCCGCCACAA GCGTACCCAC ACCGGTGAGA AGAAATTTGC 
1201 TTGTCCGGAA TGTCCGAAGC GCTTCATGCG CTCCGACAAC CTGCGCGAGC ACAACAAGAC 
1261 CCACCAGAAC AAGAAGGGTG GATCTGGTGA TGGTGGCCGT CGCGGTGGCG GTTCTGGCAA 
1321 GAAGAAGCAG CACATCTGCC ACATCCAGGG CTGTGGTAAA GTTTACGGCC GCTCCTCCGC 



Figure 260 



WO 01/21647 



PCT/IB00/01470 



41/54 



13 81 CCTGACCCGC CACCTGCGCT GGCACACCGG 
1441 CTGTGGTAAA CGCTTCACCC AGCGCGCCCA 
1501 TGAGAAGAAA TTTGCTTGTC CGGAATGTCC 
1561 CGAGCACATC AAGACCCACC AGAACAAGAA 
1621 CCTGGGGGAC GAGCTCCACT TAGACGGCGA 
1681 AGACGATTTC GATCTGGACA TGTTGGGGGA 
1741 CCACGACTCC GCCCCCTACG GCGCTCTGGA 
1801 TACCGATGCC CTTGGAATTG ACGAGTACGG 
1861 CAAGTAAGCT TCTCGAGTCT AGAGGGCCCG 
1921 CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 
1981 GTGCCACTCC CACTGTCCTT TCCTAATAAA 
2041 GGTGTCATTC TATTCTGGGG GGTGGGGTGG 
2101 ACAATAGCAG GCATGCTGGG GATGCGGTGG 
2161 GCTGGGGCTC TAGGGGGTAT CCCCACGCGC 
2221 TGGTGGTTAC GCGCAGCGTG ACCGCTACAC 
2281 CTTTCTTCCC TTCCTTTCTC GCCACGTTCG 
2341 GC ATCCCTTT AGGGTTCCGA TTTAGTGCTT 
24 01 AGGGTGATGG TTCACGTAGT GGGCCATCGC 
2461 TGGAGTCCAC GTTCTTTAAT AGTGGACTCT 
2521 TCTCGGTCTA TTCTTTTGAT TTATAAGGGA 
2581 ATGAGCTGAT TTAACAAAAA TTTAACGCGA 
2641 GTGTGGAAAG TCCCCAGGCT CCCCAGGCAG 
2701 AGTCAGCAAC CAGGTGTGGA AAGTCCCCAG 
2761 TGCATCTCAA TTAGTCAGCA ACCATAGTCC 
2821 CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC 
2881 AGGCCGAGGC CGCCTCTGCC TCTGAGCTAT 
2941 GCCTAGGCTT TTGCAAAAAG CTCCCGGGAG 
3001 AGACAGGATG AGGATCGTTT CGCATGATTG 
3061 CCGCTTGGGT GGAGAGGCTA TTCGGCTATG 
3121 ATGCCGCCGT GTTCCGGCTG TCAGCGCAGG 
3181 TGTCCGGTGC CCTGAATGAA CTGCAGGACG 
3241 CGGGCGTTCC TTGCGCAGCT GTGCTCGACG 
3301 TATTGGGCGA AGTGCCGGGG CAGGATCTCC 
3361 TATCCATCAT GGCTGATGCA ATGCGGCGGC 
3421 TCGACCACCA AGCGAAACAT CGCATCGAGC 
3481 TCGATCAGGA TGATCTGGAC GAAGAGCATC 
3541 GGCTCAAGGC GCGCATGCCC GACGGCGAGG 
3601 TGCCGAATAT CATGGTGGAA AATGGCCGCT 
3661 GTGTGGCGGA CCGCTATCAG GACATAGCGT 
3721 GCGGCGAATG GGCTGACCGC TTCCTCGTGC 
3781 GCATCGCCTT CTATCGCCTT CTTGACGAGT 
3841 GACCGACCAA GCGACGCCCA ACCTGCCATC 
3901 TGAAAGGTTG GGCTTCGGAA TCGTTTTCCG 
3961 GGATCTCATG CTGGAGTTCT TCGCCCACCC 
4021 CAAATAAAGC AATAGCATCA CAAATTTCAC 
4081 TTGTGGTTTG TCCAAACTCA TCAATGTATC 
4141 CTAGAGCTTG GCGTAATCAT GGTCATAGCT 
4201 AATTCCACAC AACATACGAG CCGGAAGCAT 
4261 GAGCTAACTC ACATTAATTG CGTTGCGCTC 
4321 GTGCCAGCTG CATTAATGAA TCGGCCAACG 



CGAGAGGCCT TTCATGTGTA CATGGTCCTA 
CCTGGAGCGC CACAAGCGTA CCCACACCGG 
GAAGCGCTTC ATGCGCTCCG ACACCCTGCG 
GGGTGGATCC GCCCCCCCGA CCGATGTCAG 
GGACGTGGCG ATGGCGCATG CCGACGCGCT 
CGGGGATTCC CCGGGGCCGG GATTTACCCC 
TATGGCCGGC TTCGAGTTTG AGCAGATGTT 
TGGGGGCAGC GACTACAAGG ACGACGATGA 
TTTAAACCCG CTGATCAGCC TCGACTGTGC 
CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
GCTCTATGGC TTCTGAGGCG GAAAGAACCA 
CCTGTAGCGG CGCATTAAGC GCGGCGGGTG 
TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG 
CCGGCTTTCC CCGTCAAGCT CTAAATCGGG 
TACGGCACCT CGACCCCAAA AAACTTGATT 
CCTGATAGAC GGTTTTTCGC CCTTTGACGT 
TGTTCCAAAC TGGAACAACA CTCAACCCTA 
TTTTGGGGAT TTCGGCCTAT TGGTTAAAAA 
ATTAATTCTG TGGAATGTGT GTCAGTTAGG 
GCAGAAGTAT GCAAAGCATG CATCTCAATT 
GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA 
CGCCCCTAAC TCCGCCCATC CCGCCCCTAA 
ATGGCTGACT AATTTTTTTT ATTTATGCAG 
TCCAGAAGTA GTGAGGAGGC TTTTTTGGAG 
CTTGTATATC CATTTTCGGA TCTGATCAAG 
AACAAGATGG ATTGCACGCA GGTTCTCCGG 
ACTGGGCACA ACAGACAATC GGCTGCTCTG 
GGCGCCCGGT TCTTTTTGTC AAGACCGACC 
AGGCAGCGCG GCTATCGTGG CTGGCCACGA 
TTGTCACTGA AGCGGGAAGG GACTGGCTGC 
TGTCATCTCA CCTTGCTCCT GCCGAGAAAG 
TGCATACGCT TGATCCGGCT ACCTGCCCAT 
GAGCACGTAC TCGGATGGAA GCCGGTCTTG 
AGGGGCTCGC GCCAGCCGAA CTGTTCGCCA 
ATCTCGTCGT GACCCATGGC GATGCCTGCT 
TTTCTGGATT CATCGACTGT GGCCGGCTGG 
TGGCTACCCG TGATATTGCT GAAGAGCTTG 
TTTACGGTAT CGCCGCTCCC GATTCGCAGC 
TCTTCTGAGC GGGACTCTGG GGTTCGAAAT 
ACGAGATTTC GATTCCACCG CCGCCTTCTA 
GGACGCCGGC TGGATGATCC TCCAGCGCGG 
CAACTTGTTT ATTGCAGCTT ATAATGGTTA 
AAATAAAGCA TTTTTTTCAC TGCATTCTAG 
TTATCATGTC TGTATACCGT CGACCTCTAG 
GTTTCCTGTG TGAAATTGTT ATCCGCTCAC 
AAAGTGTAAA GCCTGGGGTG CCTAATGAGT 
ACTGCCCGCT TTCCAGTCGG GAAACCTGTC 
CGCGGGGAGA GGCGGTTTGC GTATTGGGCG 
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4381 CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 
4441 ATCAGCTCAC TCAAAGGCGG TAATACGGTT 
4501 GAACATGTGA GCAAAAGGCC AGCAAAAGGC 
4561 GTTTTTCCAT AGGCTCCGCC CCCCTGACGA 
4 621 GTGGCGAAAC CCGACAGGAC TATAAAGATA 
4 681 GCGCTCTCCT GTTCCGACCC TGCCGCTTAC 
4741 AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 
4 801 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC 
4 861 TAACTATCGT CTTGAGTCCA ACCCGGTAAG 
4921 TGGTAACAGG ATTAGCAGAG CGAGGTATGT 
4981 GCCTAACTAC GGCTACACTA GAAGGACAGT 
5041 TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
5101 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 
5161 TTTGATCTTT TCTACGGGGT CTGACGCTCA 
5221 GGTCATGAGA TTATCAAAAA GGATCTTCAC 
5281 TAAATCAATC TAAAGTATAT ATGAGTAAAC 
5341 TGAGGCACCT ATCTCAGCGA TCTGTCTATT 
5401 CGTGTAGATA ACTACGATAC GGGAGGGCTT 
5461 GCGAGACCCA CGCTCACCGG CTCCAGATTT 
5521 CGAGCGCAGA AGTGGTCCTG CAACTTTATC 
5581 GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA 
5641 AGGCATCGTG GTGTCACGCT CGTCGTTTGG 
5701. ATCAAGGCGA GTTACATGAT CCCCCATGTT 
5761 TCCGATCGTT GTCAGAAGTA AGTTGGCCGC 
5821 GCATAATTCT CTTACTGTCA TGCCATCCGT 
5881 AACCAAGTCA TTCTGAGAAT AGTGTATGCG 
5941 ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 
6001 TTCGGGGCGA AAACTCTCAA GGATCTTACC 
6061 TCGTGCACCC AACTGATCTT CAGCATCTTT 
6121 AACAGGAAGG CAAAATGCCG CAAAAAAGGG 
6181 CATACTCTTC CTTTTTCAAT ATTATTGAAG 
6241 ATACATATTT GAATGTATTT AGAAAAATAA 
6301 AAAAGTGCCA CCTGACGTC 



GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT 
ATCCACAGAA TCAGGGGATA ACGCAGGAAA 
CAGGAACCGT AAAAAGGCCG CGTTGCTGGC 
GCATCACAAA AATCGACGCT CAAGTCAGAG 
CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT 
CGGATACCTG TCCGCCTTTC TCCCTTCGGG 
TAGGTATCTC AGTTCGGTGT AGGTCGTTCG 
CGTTCAGCCC GACCGCTGCG CCTTATCCGG 
ACACGACTTA TCGCCACTGG CAGCAGCCAC 
AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG 
ATTTGGTATC TGCGCTCTGC TGAAGCCAGT 
ATCCGGCAAA CAAACCACCG CTGGTAGCGG 
GCGCAGAAAA AAAGGATCTC AAGAAGATCC 
GTGGAACGAA AACTCACGTT AAGGGATTTT 
CTAGATCCTT TTAAATTAAA AATGAAGTTT 
TTGGTCTGAC AGTTACCAAT GCTTAATCAG 
TCGTTCATCC ATAGTTGCCT GACTCCCCGT 
ACCATCTGGC CCCAGTGCTG CAATGATACC 
ATCAGCAATA AACCAGCCAG CCGGAAGGGC 
CGCCTCCATC CAGTCTATTA ATTGTTGCCG 
TAGTTTGCGC AACGTTGTTG CCATTGCTAC 
TATGGCTTCA TTCAGCTCCG GTTCCCAACG 
GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC 
AGTGTTATCA CTCATGGTTA TGGCAGCACT 
AAGATGCTTT TCTGTGACTG GTGAGTACTC 
GCGACCGAGT TGCTCTTGCC CGGCGTCAAT 
TTTAAAAGTG CTCATCATTG GAAAACGTTC 
GCTGTTGAGA TCCAGTTCGA TGTAACCCAC 
TACTTTCACC AGCGTTTCTG GGTGAGCAAA 
AATAAGGGCG ACACGGAAAT GTTGAATACT 
CATTTATCAG GGTTATTGTC TCATGAGCGG 
ACAAATAGGG GTTCCGCGCA CATTTCCCCG 
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LOCUS pSBS5205-N 6295 bp DNA CIRCULAR SYN 

DEFINITION Ligation of 5205 into NVF (Kpnl, BamHI) 
ACCESSION pSBS5205-N 
REFERENCE 1 (bases 1 to 6295) 
FEATURES Loc a t ion/Qua 1 i f i er s 

CDS 956.. 1003 

/gene="NLS" 

/product="Nuclear Localization Signal" 
CDS 1004.. 1573 

/gene="ZFP" 

/products "LSR 1A-7B" 
CDS 1574.. 1816 

/gene="VP16" 

/products "VP16 activation domain" 
CDS 1817.. 1843 

/gene="FLAG" 

/products" FLAG epitope" 
CDS 3040.. 3923 

/gene="Neo" 

/products" neomycin resistance" 
CDS complement (5297.. 6157) 

/gene* "Amp " 

/products "Ampcillin resistance" 
BASE COUNT 1448 a 1677 c 1643 g 1527 t 

ORIGIN 

1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
61 CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
121 CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
181 TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 
241 GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
361 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 
421 ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
481 ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 
541 ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 
661 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
721 AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 
781 GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 
841 CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 
901 GTTTAAACTT AAGCTGATCC ACTAGTCCAG TGTGGTGGAA TTCGCTAGCG CCACCATGGC 
961 CCCCAAGAAG AAGAGGAAGG TGGGAATCCA TGGGGTACCG GGCAAGAAGA AGCAGCACAT 
1021 CTGCCACATC CAGGGCTGTG GTAAAGTTTA CGGCGAGCGC GGCGACCTGA CCCGCCACCT 
1081 GCGCTGGCAC ACCGGCGAGA GGCCTTTCAT GTGTACATGG TCCTACTGTG GTAAACGCTT 
1141 CACCGACCCG GGCGCCCTGG TGCGCCACAA GCGTACCCAC ACCGGTGAGA AGAAATTTGC 
1201 TTGTCCGGAA TGTCCGAAGC GCTTCATGCG CTCCGACAAC CTGACCCAGC ACATCAAGAC 
1261 CCACCAGAAC AAGAAGGGTG GATCTGGTGA TGGCAAGAAG AAGCAGCACA TCTGCCACAT 
1321 CCAGGGCTGT GGTAAAGTTT ACGGCCAGTC CGGCACCCTG ACCCGCCACC TGCGCTGGCA 
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1381 CACCGGCGAG AGGCCTTTCA TGTGTACATG GTCCTACTGT GGTAAACGCT TCACCCAGTC 
1441 CTCCGACCTG CAGCGCCACA AGCGTACCCA CACCGGTGAG AAGAAATTTG CTTGTCCGGA 
1501 ATGTCCGAAG CGCTTCATGC GCTCCGACGC CCTGGCCCGC CACATCAAGA CCCACCAGAA 
1561 CAAGAAGGGT GGATCCGCCC CCCCGACCGA TGTCAGCCTG GGGGACGAGC TCCACTTAGA 
1621 CGGCGAGGAC GTGGCGATGG CGCATGCCGA CGCGCTAGAC GATTTCGATC TGGACATGTT 
1681 GGGGGACGGG GATTCCCCGG GGCCGGGATT TACCCCCCAC GACTCCGCCC CCTACGGCGC 
1741 TCTGGATATG GCCGGCTTCG AGTTTGAGCA GATGTTTACC GATGCCCTTG GAATTGACGA 
1801 GTACGGTGGG GGCAGCGACT ACAAGGACGA CGATGACAAG TAAGCTTCTC GAGTCTAGAG 
1861 GGCCCGTTTA AACCCGCTGA TCAGCCTCGA CTGTGCCTTC TAGTTGCCAG CCATCTGTTG 
1921 TTTGCCCCTC CCCCGTGCCT TCCTTGACCC TGGAAGGTGC CACTCCCACT GTCCTTTCCT 
1981 AATAAAATGA GGAAATTGCA TCGCATTGTC TGAGTAGGTG TCATTCTATT CTGGGGGGTG 
2041 GGGTGGGGCA GGACAGCAAG GGGGAGGATT GGGAAGACAA TAGCAGGCAT GCTGGGGATG 
2101 CGGTGGGCTC TATGGCTTCT GAGGCGGAAA GAACCAGCTG GGGCTCTAGG GGGTATCCCC 
2161 ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG 
2221 CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 
2281 CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGCAT CCCTTTAGGG TTCCGATTTA 
2341 GTGCTTTACG GCACCTCGAC CCCAAAAAAC TTGATTAGGG TGATGGTTCA CGTAGTGGGC 
24 01 CATCGCCCTG ATAGACGGTT TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG 
2461 GACTCTTGTT CCAAACTGGA ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT 
2521 AAGGGATTTT GGGGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA 
2581 ACGCGAATTA ATTCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC 
2641 AGGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC AGCAACCAGG TGTGGAAAGT 
2701 CCCCAGGCTC. CCCAGCAGGC AGAAGTATGC AAAGCATGCA TCTCAATTAG TCAGCAACCA 
2761 TAGTCCCGCC CCTAACTCCG CCCATCCCGC CCCTAACTCC GCCCAGTTCC GCCCATTCTC 
2821 CGCCCCATGG CTGACTAATT TTTTTTATTT ATGCAGAGGC CGAGGCCGCC TCTGCCTCTG 
2881 AGCTATTCCA GAAGTAGTGA GGAGGCTTTT TTGGAGGCCT AGGCTTTTGC AAAAAGCTCC 
2941 CGGGAGCTTG TATATCCATT TTCGGATCTG ATCAAGAGAC AGGATGAGGA TCGTTTCGCA 
3001 TGATTGAACA AGATGGATTG CACGCAGGTT CTCCGGCCGC TTGGGTGGAG AGGCTATTCG 
3061 GCTATGACTG GGCACAACAG ACAATCGGCT GCTCTGATGC CGCCGTGTTC CGGCTGTCAG 
3121 CGCAGGGGCG CCCGGTTCTT TTTGTCAAGA CCGACCTGTC CGGTGCCCTG AATGAACTGC 
3181 AGGACGAGGC AGCGCGGCTA TCGTGGCTGG CCACGACGGG CGTTCCTTGC GCAGCTGTGC 
3241 TCGACGTTGT CACTGAAGCG GGAAGGGACT GGCTGCTATT GGGCGAAGTG CCGGGGCAGG 
3301 ATCTCCTGTC ATCTCACCTT GCTCCTGCCG AGAAAGTATC CATCATGGCT GATGCAATGC 
3361 GGCGGCTGCA TACGCTTGAT CCGGCTACCT GCCCATTCGA CCACCAAGCG AAACATCGCA 
3421 TCGAGCGAGC ACGTACTCGG ATGGAAGCCG GTCTTGTCGA TCAGGATGAT CTGGACGAAG 
3481 AGCATCAGGG GCTCGCGCCA GCCGAACTGT TCGCCAGGCT CAAGGCGCGC ATGCCCGACG 
3541 GCGAGGATCT CGTCGTGACC CATGGCGATG CCTGCTTGCC GAATATCATG GTGGAAAATG 
3601 GCCGCTTTTC TGGATTCATC GACTGTGGCC GGCTGGGTGT GGCGGACCGC TATCAGGACA 
3661 TAGCGTTGGC TACCCGTGAT ATTGCTGAAG AGCTTGGCGG CGAATGGGCT GACCGCTTCC 
3721 TCGTGCTTTA CGGTATCGCC GCTCCCGATT CGCAGCGCAT CGCCTTCTAT CGCCTTCTTG 
3781 ACGAGTTCTT CTGAGCGGGA CTCTGGGGTT CGAAATGACC GACCAAGCGA CGCCCAACCT 
3841 GCCATCACGA GATTTCGATT CCACCGCCGC CTTCTATGAA AGGTTGGGCT TCGGAATCGT 
3901 TTTCCGGGAC GCCGGCTGGA TGATCCTCCA GCGCGGGGAT CTCATGCTGG AGTTCTTCGC 
3961 CCACCCCAAC TTGTTTATTG CAGCTTATAA TGGTTACAAA TAAAGCAATA GCATCACAAA 
4 021 TTTCACAAAT AAAGCATTTT TTTCACTGCA TTCTAGTTGT GGTTTGTCCA AACTCATCAA 
4081 TGTATCTTAT CATGTCTGTA TACCGTCGAC CTCTAGCTAG AGCTTGGCGT AATCATGGTC 
4141 ATAGCTGTTT CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG 
4201 AAGCATAAAG TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT 
4261 GCGCTCACTG CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG 
4321 CCAACGCGCG GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA 
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4381 CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG 
4441 ACGGTTATCC ACAGAATCAG GGGATAACGC 
4501 AAAGGCCAGG AACCGTAAAA AGGCCGCGTT 
4561 TGACGAGCAT CACAAAAATC GACGCTCAAG 
4621 AAGATACCAG GCGTTTCCCC CTGGAAGCTC 
4681 GCTTACCGGA TACCTGTCCG CCTTTCTCCC 
4741 ACGCTGTAGG TATCTCAGTT CGGTGTAGGT 
4801 ACCCCCCGTT CAGCCCGACC GCTGCGCCTT 
4861 GGTAAGACAC GACTTATCGC CACTGGCAGC 
4921 GTATGTAGGC GGTGCTACAG AGTTCTTGAA 
4 981 GACAGTATTT GGTATCTGCG CTCTGCTGAA 
5041 CTCTTGATCC GGCAAACAAA CCACCGCTGG 
5101 GATTACGCGC AGAAAAAAAG GATCTCAAGA 
5161 CGCTCAGTGG AACGAAAACT CACGTTAAGG 
5221 CTTCACCTAG ATCCTTTTAA ATTAAAAATG 
5281 GTAAACTTGG TCTGACAGTT ACCAATGCTT 
5341 TCTATTTCGT TCATCCATAG TTGCCTGACT 
5401 GGGCTTACCA TCTGGCCCCA GTGCTGCAAT 
5461 AGATTTATCA GCAATAAACC AGCCAGCCGG 
5521 TTTATCCGCC TCCATCCAGT CTATTAATTG 
5581 AGTTAATAGT TTGCGCAACG TTGTTGCCAT 
5641 GTTTGGTATG GCTTCATTCA GCTCCGGTTC 
5701 CATGTTGTGC AAAAAAGCGG TTAGCTCCTT 
5761 GGCCGCAGTG TTATCACTCA TGGTTATGGC 
5821 ATCCGTAAGA TGCTTTTCTG TGACTGGTGA 
5881 TATGCGGCGA CCGAGTTGCT CTTGCCCGGC 
5941 CAGAACTTTA AAAGTGCTCA TCATTGGAAA 
6001 CTTACCGCTG TTGAGATCCA GTTCGATGTA 
6061 ATCTTTTACT TTCACCAGCG TTTCTGGGTG 
6121 AAAGGGAATA AGGGCGACAC GGAAATGTTG 
6181 TTGAAGCATT TATCAGGGTT ATTGTCTCAT 
6241 AAATAAACAA ATAGGGGTTC CGCGCACATT 



AGCGGTATCA GCTCACTCAA AGGCGGTAAT 
AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA 
GCTGGCGTTT TTCCATAGGC TCCGCCCCCC 
TCAGAGGTGG CGAAACCCGA CAGGACTATA 
CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 
TTCGGGAAGC GTGGCGCTTT CTCAATGCTC 
CGTTCGCTCC AAGCTGGGCT GTGTGCACGA 
ATCCGGTAAC TATCGTCTTG AGTCCAACCC 
AGCCACTGGT AACAGGATTA GCAGAGCGAG 
GTGGTGGCCT AACTACGGCT ACACTAGAAG 
GCCAGTTACC TTCGGAAAAA GAGTTGGTAG 
TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 
AGATCCTTTG ATCTTTTCTA CGGGGTCTGA 
GATTTTGGTC ATGAGATTAT CAAAAAGGAT 
AAGTTTTAAA TCAATCTAAA GTATATATGA 
AATCAGTGAG GCACCTATCT CAGCGATCTG 
CCCCGTCGTG TAGATAACTA CGATACGGGA 
GATACCGCGA GACCCACGCT CACCGGCTCC 
AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 
TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 
TGCTACAGGC ATCGTGGTGT CACGCTCGTC 
CCAACGATCA AGGCGAGTTA CATGATCCCC 
CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT 
AGCACTGCAT AATTCTCTTA CTGTCATGCC 
GTACTCAACC AAGTCATTCT GAGAATAGTG 
GTCAATACGG GATAATACCG CGCCACATAG 
ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 
ACCCACTCGT GCACCCAACT GATCTTCAGC 
AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 
AATACTCATA CTCTTCCTTT TTCAATATTA 
GAGCGGATAC ATATTTGAAT GTATTTAGAA 
TCCCCGAAAA GTGCCACCTG ACGTC 
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SEQUENCE LISTING 



<110> Genset 

<120> Methods of Screening for Compounds that Modulate the 

LSR-Leptin Interaction and Their Use in the Prevention 
and Treatment of Obesity-Related Diseases 

<130> 70.WO1 

<150> 60/155,506 
<151> 1999-09-22 

<160> 106 

<170> Patent. pm 

<210> 1 
<211> 23187 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> exon 
<222> 2001. .2356 
<223> exonl 

<220> 

<221> exon 
<222> 3540. .3884 
<223> exon2 

<220> 

<221> exon 

<222> 12163. .12282 

<223> exon3 

<220> 

<221> exon 

<222> 15144 . .15200 

<223> exon4 

<220> 

<221> exon 

<222> 15765. .15911 

<223> exonS 

<220> 

<221> exon 

<222> 19579. .19752 

<223> exon6 

<220> 

<221> exon 

<222> 19899. .19958 

<223> exon7 

<220> 

<221> exon 

<222> 20056. .20187 

<223> exon8 
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<220> 

<221> exon 

<222> 20329. .20957 

<223> exon9 

<220> 

<221> exon 

<222> 21047. .21187 

<223> exonlO 

<220> 

<221> polyA_signal 
<222> 21168. .21173 
<223> AATAAA 

<220> 

<221> misc_feature 
<222> 1. .2000 

<223> potential 5 ■ regulatory region 
<220> 

<221> misc_feature 
<222> 22324. .23187 

<223> homology with USF2 gene in ref: embl Y07661 
<220> 

<221> primer_bind 
<222> 523. .544 

<223> upstream amplification primer 17-2 
<220> 

<221> primer_bind 
<222> 1047. .1068 

<223> downstream amplification primer 17-2 , complement 
<220> 

<221> prime r_bind 
<222> 946. .963 

<223> upstream amplification primer 99-4576 
<220> 

<221> primer_bind 
<222> 1385. .1402 

<223> downstream amplification primer 99-4576 , complement 
<220> 

<221> prime r_bind 
<222> 1096. .1115 

<223> upstream amplification primer 9-19 
<220> 

<221> primer_bind 
<222> 1616. .1635 

<223> downstream amplification primer 9-19 , complement 
<220> 

<221> prime rjbind 
<222> 1602. .1621 

<223> upstream amplification primer 9-20 
<220> 

<221> prime r__bind 
<222> 2074. .2093 
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<223> downstream amplification primer 9-20 , complement 
<220> 

<221> primer_bind 
<222> 2036. .2053 

<223> upstream amplification primer 99-4557 
<220> 

<221> primer_bind 
<222> 2563. .2580 

<223> downstream amplification primer 99-4557 , complement 
<220> 

<221> primer_bind 
<222> 2084 . .2102 

<223> upstream amplification primer 9-1 
<220> 

<221> primerjbind 
<222> 2483 . .2500 

<223> downstream amplification primer 9-1 , complement 
<220> 

<221> primer_bind 
<222> 2470. .2489 

<223> upstream amplification primer 9-21 , complement 
<220> 

<221> primer_bind 
<222> 2062 . .2081 

<223> downstream amplification primer 9-21 
<220> 

<221> primerjbind 
<222> 3455. .3474 

<223> upstream amplification primer 9-3 
<220> 

<221> primer_bind 
<222> 3882. .3901 

<223> downstream amplification primer 9-3 , complement 
<220> 

<221> primerjbind 
<222> 3775. .3792 

<223> upstream amplification primer 99-4558 
<220> 

<221> primerjbind 
<222> 4336. .4356 

<223> downstream amplification primer 99-4558 , complement 
<220> 

<221> primer_bind 
<222> 4902. .4920 

<223> upstream amplification primer 99-14419 , complement 
<220> 

<221> primerjbind 
<222> 4444. .4463 

<223> downstream amplification primer 99-14419 



<220> 
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<221> prime rjbind 
<222> 6638. .6655 

<223> upstream amplification primer 99-4577 
<220> 

<221> primer_bind 
<222> 7072. .7089 

<223> downstream amplification primer 99-4577 , complement 
<220> 

<22l> primer_bind 
<222> 7995. .8012 

<223> upstream amplification primer 99-4559 
<220> 

<221> primer_bind 
<222> 8576. .8593 

<223> downstream amplification primer 99-4559 , complement 
<220> 

<221> primerjbind 
<222> 9622. .9639 

<223> upstream amplification primer 99-3148 
<220> 

<221> primer_bind 
<222> 10023. .10040 

<223> downstream amplification primer 99-3148 , complement 
<220> 

<221> primer_bind 
<222> 9964 . .9981 

<223> upstream amplification primer 99-4560 
<220> 

<221> primer_bind 
<222> 10546. .10563 

<223> downstream amplification primer 99-4560 , complement 
<220> 

<221> prime r_bind 
<222> 10996. .11015 

<223> upstream amplification primer 99-14411 , complement 
<220> 

<221> prime rjbind 
<222> 10492 . .10512 

<223> downstream amplification primer 99-14411 
<220> 

<221> primer_bind 
<222> 11972 . . 11990 

<223> upstream amplification primer 99-4561 
<220> 

<221> primer_bind 
<222> 12481. .12501 

<223> downstream amplification primer 99-4561 , complement 
<220> 

<221> primer_bind 
<222> 12005. .12023 

<223> upstream amplification primer 9-4 
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<220> 

<221> primer_bind 
<222> 12417. Tl2436 

<223> downstream amplification primer 9-4 , complement 
<220> 

<22l> primerjbind 
<222> 14102. .14119 

<223> upstream amplification primer 99-4562 
<220> 

<221> prime r_bind 
<222> 14543. .14563 

<223> dovmstream amplification primer 99-4562 , complement 
<220> 

<221> primer_bind 
<222> 14431. .14448 

<223> upstream amplification primer 99-3149 
<220> 

<221> primer_bind 
<222> 14848. .14865 

<223> downstream amplification primer 99-314 9 , complement 
<220> 

<221> primer_bind 
<222> 14748. .14767 

<223> upstream amplification primer 9-22 
<220> 

<221> primer_Jbind 
<222> 15198. .15218 

<223> downstream amplification primer 9-22 , complement 
<220> 

<221> prime r_bind 
<222> 14748. . 14767 

<223> upstream amplification primer 9-24 
<220> 

<221> prime r_bind 
<222> 15333 . .15351 

<223> downstream amplification primer 9-24 , complement 
<220> 

<221> primerjbind 
<222> 15002. .15019 

<223> upstream amplification primer 9-5 
<220> 

<221> primerjbind 
<222> 15333. .15351 

<223> downstream amplification primer 9-5 , complement 
<220> 

<221> primer_bind 
<222> 15640. .15657 

<223> upstream amplification primer 9-6 
<220> 

<221> primerjbind 
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<222> 16072. .16089 

<223> dovmstream amplification primer 9-6 , complement 
<220> 

<221> primer_bind 
<222> 15800. .15817 

<223> upstream amplification primer 99-4563 
<220> 

<221> primer_bind 
<222> 16179. Tl6199 

<223> downstream amplification primer 99-4563 , complement 
<220> 

<221> prime r_bind 
<222> 19295. .19312 

<223> upstream amplification primer 99-3150 
<220> 

<221> primer_bind 
<222> 19729. .19746 

<223> downstream amplification primer 99-3150 , complement 
<220> 

<221> primer_bind 
<222> 19420. .19438 

<223> upstream amplification primer 9-7 
<220> 

<221> primer_bind 
<222> 19824. .19841 

<223> downstream amplification primer 9-7 , complement 
<220> 

<221> primer_bind 
<222> 19798. .19815 

<223> upstream amplification primer 9-8 
<220> 

<221> primer_bind 
<222> 20137. .20155 

<223> downstream amplification primer 9-8 , complement 
<220> 

<221> primer_bind 
<222> 19913. .19931 

<223> upstream amplification primer 9-9 
<220> 

<221> prime r_bind 
<222> 20329. .20346 

<223> downstream amplification primer 9-9 , complement 
<220> 

<221> primer_bind 
<222> 20139. .20157 

<223> upstream amplification primer 99-4564 
<220> 

<221> primer_bind 
<222> 20582 . .20599 

<223> downstream amplification primer 99-4564 , complement 
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<220> 

<221> primerjbind 
<222> 20238. .20256 

<223> upstream amplification primer 9-10 
<220> 

<221> primer_bind 
<222> 20645. .20662 

<223> downstream amplification primer 9-10 , complement 
<220> 

<221> primer Jbind 
<222> 20410. .20424 

<223> upstream amplification primer 9-26 
<220> 

<221> primerjbind 
<222> 20690. .20706 

<223> downstream amplification primer 9-26 , complement 
<220> 

<221> primer_bind 
<222> 20569. .20588 

<223> upstream amplification primer 9-23 
<220> 

<221> prime r_bind 
<222> 21243 . .21262 

<223> downstream amplification primer 9-23 , complement 
<220> 

<221> primer Jbind 
<222> 20583 . .20604 

<22 3> upstream amplification primer 9-11 
<220> 

<221> prime r_bind 
<222> 21015. .21034 

<223> downstream amplification primer 9-11 , complement 
<220> 

<221> primerjbind 
<222> 20584. .20601 

<223> upstream amplification primer 99-15285 , complement 
<220> 

<221> primer_bind 
<222> 20139. .20158 

<223> downstream amplification primer 99-15285 
<220> 

<221> primerjbind 
<222> 20642. .20659 

<223> upstream amplification primer 99-15287 , complement 
<220> 

<221> primer_bind 
<222> 20207. .20227 

<223> downstream amplification primer 99-15287 
<220> 

<221> primer_bind 
<222> 20691. .20709 
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8 

<223> upstream amplification primer 99-15286 , complement 
<220> 

<221> primer_bind 
<222> 20238. .20257 

<223> downstream amplification primer 99-15286 
<220> 

<221> primer_bind 
<222> 20943. .20960 

<223> upstream amplification primer 9-2 
<220> 

<221> prime r_bind 
<222> 21295. .21312 

<22 3> downstream amplification primer 9-2 , complement 
<220> 

c221> primer_bind 
<222> 21013 . .21031 

<223> upstream amplification primer 99-15284 , complement 
<220> 

<221> primer_bind 
<222> 20582 . .20602 

<223> downstream amplification primer 99-15284 
<220> 

<221> primer_bind 
<222> 21019. .21038 

<223> upstream amplification primer 99-14407 , complement 
<220> 

<221> primer_bind 
<222> 20571. .20589 

<223> downstream amplification primer 99-14407 
<220> 

<221> prime r_bind 
<222> 21079. .21097 

<223> upstream amplification primer 99-15283 , complement 
<220> 

<221> primer_bind 
<222> 20638. .20655 

<223> downstream amplification primer 99-15283 
<220> 

<221> primer_bind 
<222> 21013. .21032 

<223> upstream amplification primer LSRi9fl5s 
<220> 

<221> primer_bind 
<222> 21195. .21214 

<223> downstream amplification primer LSRil0rl4s , complement 
<220> 

<221> primer_bind 
<222> 20354 . .20372 

<223> upstream amplification primer LSRx9fl3s 



<220> 
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<221> primerjbind 
<222> 20570. .20591 

<223> upstream amplification primer LSRx9fl4s 



<220> 

<22l> primer_bind 
<222> 20811. .20832 

<223> downstream amplification primer LSRx9rl3s 



complement 



<220> 

<221> allele 
<222> 818 
<223> 17-2-297 



polymorphic base G or C 



<220> 

<221> allele 
<222> 1243 
<223> 9-19-148 



polymorphic base C or T 



<220> 

<22l> allele 
<222> 1374 
<223> 9-19-256 



polymorphic base A or G 



<220> 

<221> allele 
<222> 1401 
<223> 9-19-307 



polymorphic base A or T 



<220> 

<221> allele 
<222> 1535 
<223> 9-19-442 



polymorphic base deletion of C 



<220> 

<22l> allele 
<222> 1788 
<223> 9-20-187 



polymorphic base A or C 



<220> 

<221> allele 
<222> 2391 
<223> 9-1-308 



polymorphic base G or C 



<220> 

<221> allele 
<222> 3778 
<223> 9-3-324 



polymorphic base C or 



<220> 

<221> allele 

<222> 4498 

<223> 99-14419-424 



: polymorphic base T or G 



<220> 

<221> allele 
<222> 15007 
<223> 9-24-260 



polymorphic base A or G 



<220> 

<221> allele 
<222> 15233 
<223> 9-24-486 



polymorphic base A or G 
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<220> 

<221> allele 
<222> 15826 
<223> 9-6-187 



polymorphic base C or T 



<220> 

<221> allele 
<222> 19567 
<223> 9-7-148 



polymorphic base A or G 



<220> 

<221> allele 
<222> 19744 
<223> 9-7-325 : 

<220> 

<221> allele 
<222> 19786 
<223> 9-7-367 : 

<220> 

<221> allele 
<222> 20158 
<223> 9-9-246 : 

<220> 

<221> allele 
<222> 20595 
<223> LSRX9-BM 

<220> 

<221> allele 
<222> 21108 
<223> LSRX10-BM 



polymorphic base A or G 



polymorphic base A or C 



polymorphic base G or C 



(17-1-240) 



polymorphic base deletion of AGG 



polymorphic base T or 



<220> 

<221> allele 

<222> 606 

<223> potential polymorphic base C or T 
<220> 

<221> allele 

<222> 5141 

<223> potential polymorphic base insertion of G 
<220> 

<22l> allele 

<222> 7428 

<223> potential polymorphic base insertion of C 



<220> 

<22l> allele 

<222> 8394 

<223> potential polymorphic base C 



<220> 

<221> allele 

<222> 8704 

<223> potential polymorphic base T 



or 



or 



<220> 

<221> allele 
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<222> 9028 

<223> potential polymorphic base G or A 
<220> 

<221> allele 
<222> 9950 

<223> potential polymorphic base deletion of GAATGAAA 
<220> 

<221> allele 
<222> 9977 

<223> potential polymorphic base T or C 
<220> 

<221> allele 
<222> 10021 

<223> potential polymorphic base A or G 
<220> 

<22l> allele 
<222> 11878 

<223> potential polymorphic base C or T 
<220> 

<221> allele 
<222> 19040 

<223> potential polymorphic base deletion of G 
<220> 

<221> allele 
<222> 21363 

<223> potential polymorphic base A or G 
<220> 

<221> allele 
<222> 21449 

<223> potential polymorphic base C or T 



<220> 

<221> allele 
<222> 21451 

<223> potential polymorphic base G 



or 



<220> 

<221> allele 

<222> 21454 

<223> potential polymorphic base A or G 
<220> 

<221> allele 

<222> 21455 

<223> potential polymorphic base G or A 
<220> 

<221> allele 

<222> 21569 

<223> potential polymorphic base T or A 



<220> 

<221> allele 

<222> 21683 

<223> potential polymorphic base deletion of C 
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<220> 

<221> allele 
<222> 21694 

<223>~ potential polymorphic base insertion of T 
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<220> 

<22l> allele 
<222> 21728 

<223> potential polymorphic base deletion of G 



<220> 

<221> misc_binding 
<222> 799.7817 
<223> 17-2-297. misl 



<220> 

<22l> misc_binding 
<222> 819. .837 

<223> complement 17-2 -297 .mis2 



<220> 

<221> misc_binding 
<222> 1224. .1242 
<223> 9-19-148. misl 



<220> 

<221> misc_binding 

<222> 1244. .1262 

<223> complement 9-19-148 .mis2 



<220> 

<221> misc_binding 
<222> 1330. .1373 
<223> 9-19-256. misl 



<220> 

<221> misc — binding 

<222> 1375. .1393 

<223> complement 9-19-256 . mis2 



<220> 

<221> misc_binding 
<222> 1382. .1400 
<223> 9-19-307. misl 



<220> 

<221> misc_binding 

<222> 1402. .1420 

<223> complement 9-19-307 .mis2 



<220> 

<221> misc_binding 
<222> 1516. .1534 
<223> 9-19-442. misl 



<220> 

<221> misc_binding 
<222> 1769. .1787 
<223> 9-20-187. misl 



<220> 

<221> misc_binding 
<222> 1789. .1807 
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<223> complement 9-20-187 .mis2 
<220> 

<221> misc_binding 
<222> 2372. .2390 
<223> 9-1-308. misl 

<220> 

<221> misc_binding 

<222> 2392. .2410 

<223> complement 9-1-308. mis2 

<220> 

<221> miscjbinding 
<222> 3759. .3777 
<223> 9-3-324. misl 

<220> 

<221> miscjbinding 

<222> 3779. .3797 

<223> complement 9-3-324. mis2 

<220> 

<221> miscjbinding 

<222> 4979. .4997 

<223> 99-14419-424. mis2 

<220> 

<221> miscjbinding 

<222> 4999. .5017 

<223> complement 99-14419-424 .misl 
<220> 

<221> misc_binding 

<222> 14988. .15006 

<223> 9-24-260. misl 

<220> 

<221> misc_binding 
<222> 15008. .15026 
<223> complement 9-24-260 .mis2 

<220> 

<221> miscjbinding 
<222> 15214. .15232 
<223> 9-24-486 .misl 

<220> 

<221> miscjbinding 
<222> 15234. .15252 
<223> complement 9-24-486 .mis2 

<220> 

<221> misc_binding 
<222> 15807. .15825 
<223> 9-6-187. misl 

<220> 

<221> misc_binding 
<222> 15827. .15845 
<223> complement 9-6-187. mis2 



<220> 
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<221> misc_binding 
<222> 19548. .19566 
<223> 9-7-148. misl 

<220> 

<221> misc_binding 
<222> 19568. .19586 
<223> complement 9-7-14 8.mis2 

<220> 

<221> miscjbinding 
<222> 19725. .19743 
<223> 9-7-325. misl 

<220> 

<221> miscjbinding 
<222> 19745. .19763 
<223> complement 9-7-325.mis2 

<220> 

<221> misc_binding 
<222> 19767. .19785 
<223> 9-7-367. misl 

<220> 

<221> misc_binding 
<222> 19787. .19805 
<223> complement 9-7-367. mis2 

<220> 

<221> misc_binding 
<222> 20139. .20157 
<223> 9-9-246. misl 

<220> 

<221> miscjbinding 
<222> 20159. .20177 
<223> complement 9-9-246. mis2 

<220> 

<221> misc_binding 
<222> 20576. .20594 
<223> LSRX9-BM. misl (17-1-240) 

<220> 

<221> miscjbinding 
<222> 20596. .20614 

<223> complement LSRX9-BM .mis2 (17-1-240) 
<220> 

<221> misc_binding 
<222> 21089. .21107 
<223> LSRXlO-BM.misl 

<220> 

<221> miscjbinding 
<222> 21109. .21127 
<223> complement LSRX10-BM.mis2 

<220> 

<221> misc_binding 
<222> 587. .605 

<223> potentialsite606 .misl potential 
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<220> 

<221> misc_binding 
<222> 607. .625 

<223> complement potentialsite606 .mis2 potential 
<220> 

<22l> misc_binding 
<222> 5122. .5140 

<223> potentialsite5141.misl potential 
<220> 

<221> misc_binding 
<222> 5142. .5160 

<223> complement potentialsite5141 .mis2 potential 
<220> 

<221> misc_binding 
<222> 7409. .7427 

<223> potentialsite7428 .misl potential 
<220> 

<221> misc_binding 
<222> 7429. .7447 

<223> complement potentialsite7428 .mis2 potential 
<220> 

<221> misc_binding 
<222> 8375. .8393 

<223> potentialsite8394 .misl potential 
<220> 

<221> misc_binding 
<222> 8395. .8413 

<223> complement potentialsite8394 .mis2 potential 
<220> 

<221> misc_binding 
<222> 8685. .8703 

<223> potentialsite8704 .misl potential 
<220> 

<221> miscjbinding 
<222> 8705. .8723 

<223> complement potentialsite8704 .mis2 potential 
<220> 

<221> miscjbinding 
<222> 9009. .9027 

<223> potentialsite9028 .misl potential 
<220> 

<221> miscjbinding 
<222> 9029. .9047 

<223> complement potentialsite9028 .mis2 potential 
<220> 

<221> miscjbinding 
<222> 9931. .9949 

<223> potentialsite9950 .misl potential 
<220> 

<221> miscjbinding 
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<222> 9951. .9969 

<223> complement potentialsite9950 .mis2 potential 
<220> 

<221> misc_binding 
<222> 9958. .9976 

<223> potentialsite9977.misl potential 
<220> 

<221> misc_binding 
<222> 9978. .9996 

<223> complement potentialsite9977 .mis2 potential 
<220> 

<221> miscjbinding 
<222> 10002. .10020 

<223> potentialsitel0021.misl potential 
<220> 

<221> miscjbinding 
<222> 10022. .10040 

<223> complement potentialsitel0021 .mis2 potential 
<220> 

<221> misc_binding 
<222> 11859. .11877 

<223> potentialsitell878 .misl potential 
<220> 

<221> miscjbinding 
<222> 11879. .11897 

<223> complement potentialsitell878 .mis2 potential 
<220> 

<221> miscjbinding 
<222> 19021. .19039 

<223> potentialsitel9040.misl potential 
<220> 

<221> miscjbinding 
<222> 19041. .19059 

<223> complement potentialsitel9040 .mis2 potential 
<220> 

<221> misc_binding 
<222> 21344. .21362 

<223> potentialsite21363 .misl potential 
<220> 

<221> misc_binding 
<222> 21364. .21382 

<223> complement potentialsite21363 .mis2 potential 
<220> 

<221> misc_binding 
<222> 21430. .21448 

<223> potentialsite21449.misl potential 
<220> 

<221> misc_binding 
<222> 21450. .21468 

<223> complement potentialsite21449 .mis2 potential 
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<220> 

<221> misc_binding 
<222> 21432. .21450 

<223> potentialsite21451 .misl potential 
<220> 

<221> misc_binding 
<222> 21452. .21470 

<223> complement potentialsite21451 .mis2 potential 
<220> 

<221> misc_binding 
<222> 21435. .21453 

<223> potentialsite21454 .misl potential 
<220> 

<221> misc_binding 
<222> 21455. .21473 

<223> complement potentialsite21454 .mis2 potential 
<220> 

<221> misc_binding 
<222> 21436. .21454 

<223> potentialsite21455 .misl potential 
<220> 

<221> misc_binding 
<222> 21456. .21474 

<223> complement potentialsite21455 .mis2 potential 
<220> 

<221> misc_binding 
<222> 21550. .21568 

<223> potentialsite21569.misl potential 
<220> 

<221> misc_binding 
<222> 21570. .21588 

<223> complement potentialsite21569 ,mis2 potential 
<220> 

<221> misc_binding 
<222> 21664. .21682 

<223> potentialsite21683 .misl potential 
<220> 

<221> misc_binding 
<222> 21684. .21702 

<223> complement potentialsite21683 . mis2 potential 
<220> 

<221> misc_binding 
<222> 21675. .21693 

<223> potentialsite21694 .misl potential 
<220> 

<221> misc_ binding 
<222> 21695. .21713 

<223> complement potentialsite21694 . mis2 potential 
<220> 

<221> misc_binding 
<222> 21709. .21727 
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<223> potentialsite21728 .misl potential 
<220> 

<221> misc_binding 
<222> 21729. .21747 

<223> complement potentialsite21728 .mis2 potential 
<220> 

<221> misc_feature 

<222> 22113,22122,22227,22264,22268 
<223> n=a, g, c or t 

<400> 1 

ccataatcaa gaaaatggat aataagtttt ggtggggatg tggagaaatt ggaatcctcc 60 

gtgcattgct ggtgggaatg tacaatagtg cagtcattgg ggaaaacagt ttggcagttc 120 

ctcaaaaggt taaaaataga actaccaagt cacccagcaa ttccattctt aggcatatat 180 

tcaaaagaaa tgaaagcaga tatttgtaca ccagtgttca cagctgcact atttacaata 24 0 

gtcaaaaggt agaaacaacc taggtccatc cacaaatgaa tggataaata aaacgtagca 300 

tatacataca atggtacact agtccgctgt aaaaagaaat tttgatctta ctgcatgcta 360 

catggcttcg acatactaca acatggatgg accttgaaaa cattattctt tgtgaaataa 420 

actagacaca ggacaaatgt tagacgattc cacttatatg aggcacctag aatgggcaat 480 

ttggtaagca aagtagaata gaaattacta ggggcacagg tagcagggaa tggggagtta 540 

ctgtttaatg gtcacagagt ttatgttggg gatgatgaaa cagtttcggg gataaagagt 600 

ggtgactggt acacgacatt gtgaatatac ttaatgccac tgaattttac acttgaagtg 660 

gttaaagcga taaatattat agtttgcata ttttatcata aaaatatttt tttaaacgat 720 

gaagggacgt gaacgggttg aaattttata aaaagtggcc agggaaggtg tcactgcaat 780 

ggtgtcctac aggaggagga agatcatgtg gacatctscg ggaagggtgt tctggcagag 840 

ggagtagcac gggcgatggc tctgaggact gtgagaagta tagttggaaa cagcgaggag 900 

gccagggtgt ccgaagctga gtaagccaga gagagtggga ggaggtgaga taagaggggg 960 

aaggtcagtt tctgctgaga gtgaggagga gccacaggag ggctgtgagc aggtggacgt 1020 

gatctggctt gagttttaac agggccagta gaacaaagca cgcctgggta ccgaaaccag 1080 

ccactggcca gttggcaacc tgggggagtc taacgcgagg aagcgcccag ggttccccca 1140 

ggatgcgctt tccctcgccg ccacctggag acagcagagt cacgcccagc gctgcgcagg 1200 

ctgatcgccg cgccgcgccc ccgccctcgg tcgcaggtgg ctygttccgg gaattcctaa 1260 

gcggaaaccg gtcccaagcc ccgcgccttc gctcggcccc tttaagagcc agaatttccg 1320 

gagggctgac ccggggctag ggatgcccag gggccgaacc acaagttggg aacrggtggg 1380 

ggaggtggcg aaaacttccg wagtggaatt ccaacttttc ctggccctga ttccccttgg 1440 

gcatccctga gggggcagag cttcccttcc ggggacttta gagggttcct caggtcatct 1500 

aactgggaga cacaggaggc ccgaagcgcc ccccctccac ccggtccgga ggaaccccag 1560 

tggaagtgga gaagtcaggc gccaccaaca agcctctccc agccaggact ttgcttagac 1620 

tcgctcctcc cggcagggcg cacctaggcg ggtccatcgc cagccgggga gaggggtttg 1680 

ggcagggagg gaacaggtgc gcggcgggac ccgccctatc tcaacaggtg aatcgctcca 1740 

agtgggtctc ggttgcatgg atctcggtgc gcttggtttg gccggagmag atgggggccg 1800 

gaagggacct gtggtccgca ggcgccctcc cagcgggcca gtcacttggt tcgggccctg 1860 

ggggacggag cgcacctggg tcagcccact tccggggagg gaggcagagg aacccctccc 1920 

cgccgctcac ccctaagccc agccctcggc tcccaccctt gtgtacctgg gccgaaccat 1980 

tcaccggagc gcgcagcggg tggagtgtgg ctcggaggac cgcggcgggt caagcacctt 2040 

tctcccccat atctgaaagc atgccctttg tccacgtcgt ttacgctcat taaaacttcc 2100 

agaatgcaac aggacggact tggagtaggg acaaggaacg gaagtgggaa ggggaggagc 2160 

gtgcacccct cctggccttg gtgcgcgccg cgccccctaa ggtactttgg aagggacgcg 2220 

cgggccagac gcgcccagac ggccgcgatg gcgctgttgg ccggcgggct ctccagaggg 2280 

ctgggctccc acccggccgc cgcaggccgg gacgcggtcg tcttcgtgtg gcttctgctt 2340 

agcacctggt gcacaggtac ggggcacggg gcctctgacg ctgcggaacg scggagggaa 2400 

ctgtagaggg ggatggatgg agttggaggc ggcgggaagc gggaagcggg ggtctcagag 2460 

gctgggacct tccgatcccc tgggtcttgg gcgatctgtt gcgcgcggga gtgagaggaa 2520 

ttccccattt gtgccgggga gcgctccccg cgcccttatc tggaagatag caggaagtga 2580 

aactccctgg acggtgagac ccggagcggc agggagaatg gaactctttg tggggaggga 2640 

gtggaagacc gcccgatctc tgggaaaaga aaagccggga tgggacttgg gcgcacccgg 2700 

ggatttctaa gttttggagt aacggggaga gggcacggga gggctggatc agacgcttcc 2760 

tagagggaca gagacgaagg aacaatgcct aggcctcggg tgggtgtggg actggggact 2820 

ccccatcccc cgcaccccac ccacctcccg cgggctccgg attatacgtg cgtaagagtc 2880 

tggtgggatg gatttacgga cttgaaaccg acttctgctg gcaggctttc acctggatgg 2940 

gatatttggg tggtgatgag gtctttcccg agacactttt ggttcagtca tttgaaatga 3000 
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ctttagagta gggtgaggtg gtgggaggct gatggagata ttgtgggggc tttagtccct 3060 

ccatggcaaa gcagttcagg caaacaactc catggttttc cctccaaatt caaaaggccc 312 0 

cgggtaacct ggaatccttc gtagtcggtt ttgaagtggg gccttgggcg ctgggggcat 3180 

caacatggcc atctgggctt gcctgcccag gccacacaga ggccccttgt tgtgggtgaa 3240 

tggcaaaggg aagaggggac tggtgtggtt cagaggccac aggctgggaa gagggatggc 33 00 

gggcgagtcc aaggaaactg gccgtgtcac cgtgcacctg ccacttcagc cccacgggtc 3360 

tataaaatgg gcatgattat cgtggctacc tcactggtcc tggcaattaa ggaacaatgt 3420 

gtgccaggca ctctgtaaac cacatacttg cgagtgtcaa gctggtgaca ggtggcgttc 34 80 

ctgttgaagc acctccctga gctcacagca acccttgctg tctctcctct tgccctcagc 3540 

tcctgccagg gccatccagg tgaccgtgtc caacccctac cacgtggtga tcctcttcca 3600 

gcctgtgacc ctgccctgta cctaccagat gacctcgacc cccacgcaac ccatcgtcat 3660 

ctggaagtac aagtctttct gccgggaccg catcgccgat gccttctccc cggccagcgt 372 0 

cgacaaccag ctcaatgccc agctggcagc cgggaaccca ggctacaacc cctacgtyga 3780 

gtgccaggac agcgtgcgca ccgtcagggt cgtggccacc aagcagggca acgctgtgac 384 0 

cctgggagat tactaccagg gccggaggat taccatcacc ggaagtatgt tgggcagggc 3900 

agggggatga ggctgggctt gcccgggtgg tgggactggc gtccttgtgc gggacctgga 3960 

gtccccatct gaaagctctt gagtgccagt gtctgaaagg accattgaag ggagcaattc 4020 

tttttttttt tttttttgaa gatggagtct tgctctggac tccaggctgg agtgcagtgg 4080 

tgcgatctca gctcactgca acctccacct cccaggttca agcaattctc ttgcctcagc 4140 

ctcccgagta gctgggactc caggtgcgtg ccaccacgcc cagttaattt ttgtattttt 42 00 

agtagagatg gggtttcacc atgttggcca ggctggtctc aaactcctga cctcaaatga 4260 

tctgcccgcc ttggcctcgc aaagtgctga gagacaccat acccagccta aagggagcga 4320 

ttctattcta ctattcttcc ttctgctaat ccttccattc tttaatttaa taacgaagat 4380 

tttttgagta' cctgtcatat accaggtgct gttctgggcc ctgggaatac agctgttaac 4440 

aaaatcatca aaccacttcc ctcgtggagc ccacattgca gtgagagaga caaacackac 4500 

acacactctc aagtccttga agataaagaa aactgggtaa cggagagaag aggccagggt 4560 

ttgttctata atcattaata acacgagcag taagaagtaa aatttatcta agtaacaact 4620 

tataaagggt ctactgtgtg ctaagctctc atccaggttc ccaaggatta actcagacca 4 680 

cacagtaatt gaatagattc tatcattgtc atcttacaga ggcccagaga gagaaagtga 4740 

cttgcctagt gtcatagctg gtaacggggc tgggattcta actcagccac tttgggtcta 4800 

gtggccaagc tcctaatccc tttgcttgcc tagggtggtc cgcagaggac tcacagagga 4 860 

gatggcagga gtgaactgca ggggcaagag agcttaatgg agaaagcctg tgacatgcca 4 920 

ggaactgcac acatattctc ccattgagtc ctctcctcta ccctcctgac agctgaggca 4 980 

cagagaggtt accttgttca aatgggtgca taggaagtca aagtctggag ctggggtttg 504 0 

aacccaggca gccctgagaa ccttgttctt tttttttgag acggagtctc gctctgtcgc 5100 

ccaggctgga gtgcagtggc gggatctcgg ctcactgcaa gctccgcctc ccgggttcac 5160 

gccattctcc tgcctcagcc tcccaagtag ctgggactac aggcgcccgc cactacgcct 5220 

ggctaatttt ttgtattttt agtagagacg gggtttcacc gttttagccg ggatggtctc 5280 

gatctcctga cctcgtgatc cgcccgcctc ggcctcccaa agtgctggga ttacaggcgt 5340 

gagccaccgc gcccggcccc ttgttcttaa ctgtaatgct gcctcctgat aggatgtgcc 5400 

tgttgggact aagtaagggg cagtcattca ttcattcatt tggtatttat caagcatcga 5460 

ctatgtgtcg ttggtgctgg ggatagaggt gattgggatg gctgaagttt ctgtcgtcaa 5520 

ggagatgaca ttctggtgga gtgagactgg cagtaaataa gcagataaag aaagagtatg 5580 

agaatttcaa agtctgggca cggtggctca cgtctgtaat ctcagcactt tgggaggcca 5640 

aggtgggtgg atcacctgag gtcaggagtt ccagaccagc ctggccaaca tggtgaaacc 5700 

ccgtctctac taaaaataca aagattagcc aggcatggtg gcacatgcct gtaatcccag 5760 

ctactcagga ggctgaggca tgagaatcgc ttgaacccag gaggcagagg ttgcagtgag 5820 

ctgagatcgc accactgtac tgcagtctgg gcgacagagt gagactctgt ctcaaaaaaa 5880 

aaaaaaaaaa aaaagactcc gtcaaggtat aagaatgtca gagagtacta agtgttgcaa 5940 

agaaaataac accaggctgg gtgcattggc tcatgcctgt aaatttcagc actttgggag 6000 

gccaaggcag gaggatcact tgagcctagg agtttgagac cagcctggac aacaaaatga 6060 

gaccccatgt ctacaaaaat tttaaaaatt taaaaattag ctgggcatgg tggcatgtgc 6120 

ctgtggtccc ggctgctcag gaggctgagg tgggaggatt gcttgggctt gagaggtcaa 6180 

ggcttcagtg agtcatgatc gtgccactgc attccagcct gggtgacaga gtgagaccct 6240 

gtcttgaaat gaaaagaaaa taggctgggc gcagtggctc acacctgtaa tcccagcact 6300 

ttgggaggcc gaggtgggtg gatcacctga ggtcaggaga tcgagaccag cctggccaac 6360 

atggtgaaat cccatctcta ctaaaaatac aaaatttagc cgggcgtggt ggtgggcgcc 6420 

tgtaatccca gctactcggg aggctgaggc aggagaatcg cttgaacctg ggaggcgaag 64 80 

gttgcggtgc gccaagattg cgccactgca ctctagcctg ggaaacagtg agactccgtc 6540 

ttaaaaaaaa aagaaaaaag aaaatagcac tgggtgatgt gctacatgga atgacttggg 6600 

ctgtgaatat gatttgagga gggcctgggc ctgggcctta cagaacctag aaggcagaga 6660 

ggaaggggag gggcagggtg ccagggatga aggctcacgt acctcatgtc ttagtgtgtg 6720 

ttcactgtct taaacaagaa tttaaagttg ggcatggggc agagcgggga agggagcatc 6780 
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cctttgcaga ccccaagaag ccaggaactg gagcacattc tgctagagga tcgatgggaa 6840 

gcagggttcc aggggctgag cctatgtcag tcctgtttca gaggaggcac caggcttgct 6900 

tgccctgaat ttctgtgggc agctcagcca tgagcatcct actgttattg aggtcacagg 6960 

gctgcttagg ccccctcctc tctaacccag ggattgtgcc tgcctggacc aggcgtgact 7020 

gctaagcttc tgccaggaca agccaaatac tgagggtgct tcctctgctg gacgcaaaag 7080 

tccaggatga ccccccaggc tctgtctcgg ggaaggggcc ctgcatgctc caggggcctc 7140 

acaggcctgg gtctttcaaa ccacccccac ctgggcctgt gtttgatcaa ggccctgagt 7200 

gtaaacatcc attgtgtgtg tcctttcagg aaatcccata gccataggag cttcctctgt 7260 

ttcagctttg aggatgggga aaagtggact ccccgtggtg ttcctagggt cacccactgt 7320 

gctggggttt ttctgttgtt gttgtttttt ttctgttgcc caggctggag tgcagtggtg 7380 

caatctcagc tcactgcaac ctctgcctcg caagttcaag tgattctccc gcctcagcct 7440 

cctgagtagc tgggattaca ggtgcacacc accacacctg gctaattttt gtatcttttt 7500 

ggtagagatg ggatttcgcc atgttggcca ggctggtctc aaactcctga cctcaggtga 7560 

tctgcctgcc ttggcctccc aaagttctgg gattacagat gtgagccacc atgcccggcc 7620 

tatcctggtt tcaaaagtga aaatagtcct ggataaggta gaaggctgtc cactccaggc 7680 

atccctccgg tccggtggct cattccctgc tttgtccttc catgctttgg gtgatggacc 7740 

agcacctgga caggaggccc tgttccacct cctcgggctc cttggggtcc aagtgccccc 7800 

acctccagct gcactgcagc agagagccca tgggacctct gaaatcatga aggtcacctt 7 860 

tgcggtgtat aaagaaggaa ccagaggttg gagatgtgga ggaggcctgg ctgctgttcc 7 920 

cactggagac ctggcatctt ctccccgacc taaaacaatg aaagcagtgc tcagcccgga 7 980 

tgagatcacg gccagcccaa gaccaggaac agggtacgcc ctgcaggaag aaggtgtgcc 8040 

cagaccttag gatggatcaa aagaagccgg aaaactatat tttttgtgag ttttgaaaat 8100 

gtcagacagg tcaaacaaaa cacagtgagg tccagcctcg gcctacaaga tgccagattt 8160 

caacccctgg cctatatgat ctgtttgcca tggcaggcgg ttcctgtcca cctcttttgt 8220 

ttatagcagg gaccagctct tgagctccag tgttgaagag gcacggtcag ggtctgatct 8280 

gaagacactg gtggctcatg cctgtaatcc cagcacttca ggaggccgag gcaggaggat 8340 

tgcttgagga caggagctgg gagaccagcc tgggcaacac agtgagaccc agacactaca 8400 

aaaaaataaa tttagcgggg catgatggca caccctgcta ctctggagat gggaagattg 8460 

cttgagccta ggagttcgaa gctgcagtga cccatgatcg caccactgca ctccagcctg 8520 

ggcgaccaag ctaggccctc tcaaaaaaga tacaggtgga aaaatgatgg acgaagaggg 8580 

cattgtggca aacctgggga tttaggagaa cctagtttgg aattctatga ggattcaatg 8640 

aaagaatgtg tgtagagggg cccagcacat agtaagagct caataaacgg tgggggctag 8700 

gggtggtggc tcatgcctgt aatcccagca ctttgggagg ctgaggcagg tggatcactt 8760 

gagccctgga gttcaagatc aacctggaca acaaagcaag atcccatctc aaaattaaaa 8820 

aacaacacca acaacaaaaa aacagtggct tagatgcctg atcattaggg taagtcgtgt 8880 

cctcaacccc ttcacatctg ctctgaaggt caccatatcc ggaagccttc cctggcctcc 8940 

ttgtttaaaa tggcacagcc cccactccac gcctggcact ctctgctgtc cctgattcgt 9000 

tttctccata cagcttatct ttgtctggta tgtgacatag ttaacatttt atatttgtct 9060 

ttctttccta gttagaatct gaactctaga agggcaaggg caaggattta taactcaaag 9120 

attccgggct taggcctctt ttatattctt gattttgagg ttaattaaga gctcaggcct 9180 

agcgaggtgg ctcatgcctg gaatcccagc actttgggag gcccaggcgg gcagatcact 9240 

tgaggtcagg agttccagac ctgcctggcc aacacagtga aaaacctgtc tctactaaaa 93 00 

atacaaaaat tagccagtta tgttggcagg cgcctataat cccagctact caagaggctg 93 60 

aggcaggaga atcgcttgaa cccaggaggc agaggctgca gtgagccaag atcgtgccac 9420 

tgcactccag cctgggcaac agagcgagac tccatctcaa aaaaaaaaaa aaaattaaga 94 80 

gctcaaagag tttgttttca taggcagcag aatgagaaaa gtttacaaaa tagtttaaat 9540 

gacaataaag tcattataga ttaacataaa taaaatacct tttatgaaaa aaataatcat 9600 

tttctgaaat cagacaaaac attgtgaatg agaaggtggc atggttttat ttttttgcaa 9660 

gtctccgaag cctggctgga tagaagagcc tggcttctca gagctgcttc agtctgttgt 9720 

gatatctatt gtatgtcacg tagcctctgg aaaactccac agttagtatt gttgggaaaa 9780 

taactttgac ctcaggatct cctgaaaacg tcttggggaa ccccagggtc tagaggctgc 9840 

agtttgagaa ctgttgctgt ggtatcccag gtgtctcaaa tactgcctag aacataggtg 9900 

gtactcagta attattgttg aaggatgaat gaatgaatga atgaatgaat gaaagaaaga 9960 

aatgtgtctt tgaatctagc catgtgccca gaatgatgag acagatgaca aaagctaagg 10020 

gactttagca tgaggagagg gggttcgttt cctttttttt cttttttttt tgagatggag 10080 

tctcactcta ctgcccaggc tagagtgcag tggtgcaatc tcagctcact gcaatctctg 10140 

cctcctgagt tcaagcaatt ctcctgcctc agcctccagg gtagctggga ctacaggtgc 10200 

gtgccaccat gcctagctaa ttttttacat ttttggtaga gatggggttt taccatgttg 10260 

gccgggctgg tctggaactc ctgacctcaa gtgatccacc tgcctcagcc tcccaaagtg 10320 

ttaggattac aggtgtgagc caccatgtcc ggccaagagg gtgttcattt ctgctccttg 10380 

ccaggtattg tgtcaggcac tggggaccca gcagtggctg agacagacag ggctctgcct 10440 

cacggagccc acattttcac caggcaaagg atggtcggcc cctaagctgg gagataagac 10500 

ttcagcagtt gggtggggga gccgtgggag aagcccagcc cacaggggga cagtgcaaat 10560 
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ctagaaccaa ggcgatggca ggggtgaggc tggcacggta gctagagacc acgtcgtgcc 10620 

aagggccttg gggaccatgg gactatggga ccttagggaa ggcgtctgga atgctgtagc 10680 

cagacactgt tgcaaggagg atttttctgt agacatgagg ccttccttat gaagaaagca 1074 0 

agggttcttt cattcctggg ggtgccaggt gctgtggact gcagcacgcg tggttgctgc 10800 

cgtcacagag ctgtcatgca ggagggcagc gcgtccttgg gaaggtggca ggcaggtcag 10860 

gctaggagga aagaggccgg gaagctgagg gcatttcctg cccgagatgc ccaatgtagc 10920 

ctacttctgt ccccagtggc ttaaggcaga gttgcctggt aggtgccctg gtcccaccct 10980 

ggtgaaaggc tgaaggtatt taattagtgc ctgagaagca gagaggaaac aggatgtgcc 1104 0 

aaaacacttt gatggatggt agagttaaca ggctccttgc ctgcagctgc ttcagacaag 11100 

agcgtcccca agccctgggc ctgacctgga atgtggggat ggaaggggag ggggaggaac 11160 

caaggcactg ggagggtaag tctctctctc ccacatagac acacccactc cttatgggtg 11220 

cctgggcatc tcctggtacc tagaatctgg cctgtttatc tccacaccca tccctggggt 11280 

ctacactagg ccctgtgggt ggcagttcac atcaggggag ttctgacttt ggctctgaga 11340 

ggtggttcag agatggctgt aagttgagaa gcacagactg ctgggtgtgg tggttcacgc 114 00 

ctgtaatccc agcactttgg gaggctgagg tgggggtgga tcacctgagg tctggagttc 11460 

aaaaccaact tggtcaacat ggcgaaactc catctctact aaaaatgcaa aaattagcca 11520 

ggtgtggtgg caggtgccta taatcccagc tacatgggag gctgaggcag gagaatcgct 11580 

tgaatctggg aggcgaagat tgtagtgagc cgagattagt tcgcaccatt gcatgccagc 1164 0 

ctgggcaaca agagtgaaac tccgattcaa acaaacaaaa aaaaaaagct gggcatggtg 117 00 

gagtgcctgt agtcctaact actcaggtgg gaggattgct tgagtccagg aggttgaagt 11760 

tgcagtgggc tataattaca ccactgcact ccagccaggg ccacagagtg agaccctgtc 11820 

tctaaagaaa gaaaaaaaaa aacaacctca ggctccgagg gcaccattac tgctctacac 11880 

tgaagagctg tgcagctttt ccagacccga aatgtcatcc acaaaacaga agtgataatg 1194 0 

gtcctgcctc acagacttct tgcagtagtc caggtgttta gaacggggtg taaaaggccg 12000 

tgtgcccttg gtaggaatct ttgcatatgc atttgatcat ctgcagcctg cccagcccac 12060 

tgcttgcccc ctcctgggtg tgctgggaag gggtctttgg ccctccaggg gttaggtgcc 12120 

ccagcctcca aggtgccctc acgccttttc atcccgactc agatgctgac ctgacctttg 12180 

accagacggc gtggggggac agtggtgtgt attactgctc cgtggtctca gcccaggacc 12240 

tccaggggaa caatgaggcc tacgcagagc tcatcgtcct tggtgagtgg gcctgggaag 123 00 

ggggaggcat ggcccttcct tttgtccgct tctgttctgt ctgccctccc ctgtgtccgc 12360 

cctctgccct ccagcttacc ctctgggctc tgtcgcctgc tctgctctcc cccaggctct 12420 

gccagtcact taggctcccc tgtgccctgc accccaggca gggaccactg gcccacagtg 124 80 

cctccaatca cccaagccaa actaagagaa gagtggagac aattggagac tctgcctttt 12540 

caaagtctca tttttaaaaa aaatccagac ttggggtccg ggtgcggtag ttcatgcctg 12600 

taatcccagc actttgggag gccgaggcgg gtggatcact tgaggccagg agttcgagac 12660 

tagcctggcc aacgtggcaa aatcccgtct ctataaaaaa tataaaagcc aggcgtggtg 12720 

gtgcacatgc ctgtaatccc agttactcag aaggctgagg catgaggatt gcttgaacct 12780 

gggaggcaga ggatgcagta agccaagatc aagccactgc actccagcct gggcgacaga 12 84 0 

gtgagactct gtccaaaaaa aaaaaaaatc cagacgtggt cagagtccat gggcagtgaa 12900 

tgaggacagt tgatggtgtg caaaatcgac ccacctcttg ctacatcccc aaggcctcat 12960 

ctcacccgag tccctcgcca aagcacagcg gttttgccgt gtgccctgct gggatggcgc 13020 

tgcatggcac acacactgtg taagtttgag tgcagctgaa acgaagccga ttccagacac 13080 

ccaggggcag ggcggggtgt ccgtgtggct gggaggcctc cttgtgttag ggggatgttg 13140 

ccatcggcca ggtgccctgc tgtaagccaa cacatggagt cttgtatgac atgtgctctg 13200 

catgagtgat gccgctgggc tgtacactgc catcttcaca tgtgtgaatg agcacgtgac 13260 

tggggggtac ttgggctgca agacagagtt catgtgtggg ggatggaaca cgtgcaccag 13320 

tgacccagga acctctgcct gttcttcggt aaaatgcacc atttgcatca gcagttccca 13380 

aaattagtct ccaggtctat ttacactcta aaacattatc gagggtctcc aagagctttt 13440 

gtttgtttct gtgggtttta tgtctatctg ttgcttaaca tattaggaat taaaatgggg 13500 

agattttcct tttttttttt ttttttttga gatggagtct cgttctgtcg cccaggctgg 13560 

agtgcagtgg ctcgatctcg gctcactgca agcttcacct cctgggttca cgccattctc 13620 

ctgcctcagc ctcccaagta gctgggacta caggcacccg ccaccacacc cggctaattt 13680 

tttttgtatt tttagtagag actgggtttc accatgttag ccaggatggt ctcgatctcc 13740 

tgacctcgtg atccacccac ctgggcctcc caaagtgctg ggattacagg catgagccac 13800 

tgcccggcct taaaatgggg agatttttca agcccaagat acacaaggaa gactgggcaa 13860 

catggcaaga ccctgactct acaaaaaatt ttaaaattaa ccaggcatgg tggcatgcac 13920 

ctgtgagccc agcttcttgg gaggctgagg caggagtatc gcttgcaccc aggaggtcaa 13980 

ggctgcagtg agccatgact atgctactgc actctagcat gagtgacaga gaccctggct 14 04 0 

caagaaacac aaacacacac acacacacac acacgcatat agtccattag gcatcagggc 14100 

gatgatggca tcagggagcc tgggaaactc tactggacat tcatgggaga acaagtgaaa 14160 

aaggcaaata acatcttagt gttattctaa aatttcttct tttggccttg tggacaggac 14220 

cacgctttga gagctgtgac tgacatgcct ctgtcctgtt gcgagggcct atagtgccaa 14280 

gtgcatgagc tctggggagg gcttcgtggg tgcagagctg ggcctgtgga ggcccctcag 14340 
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acacaacact 
tgagatgcgt 
aggactgcat 
aaaaaaaatg 
tgttcagatt 
agccctagct 
agctgcagtg 
tatttcgtgg 
ggagcccttt 
tagagccctc 
ttatgcaatg 
ccatgcrcta 
ccagcctgga 
ctgtttctct 
tcaggcgggg 
ccatacatca 
cactgtggac 
gctgggccta 
tggcacctgc 
ggggctgcag 
acctggggga 
cccagcctct 
tcgtccccga 
tccctcccag 
tgtggtatgc 
ctgcccgcac 
ccccgaggcc 
aagggaagga 
aatttagcca 
ttcctccaac 
atacacatgg 
taaaaaaacc 
ggccaagacg 
aaacccagtt 
tcccagctac 
agtgagctga 
aaaaaaaaag 
actacatgat 
tccaggcagt 
cctcccgcct 
attttttgat 
aaactcctgg 
catgtgccac 
ttgcccaggc 
atgggattac 
ttttggagac 
ctgcaacctc 
gatataggcg 
tgccatgttg 
cctaccttgg 
ttaagcaaat 
tgtctagggt 
agatgagagc 
acagagagag 
tgctcctgtc 
agctgattga 
agtggcactg 
gaccacagct 
tgtgcacaca 
caggtggaga 
cctccctagg 
tgggtgcatc 
atataaaata 



ggtggggctc 

gaatgtgaca 
atgactttca 
tgtgattatg 
tgtaaagtaa 
acttacccca 
agccatgatc 
gccctaggtc 
ggggtgctaa 
agaaccccct 
ggagtttctt 
gggcttcagc 
gctgggcttt 
tttgtccctc 
cccatagaag 
cctactgctt 
ccctcactaa 
ggctaggctg 
ggtgctcttg 
tctgtctccc 
gggccgtaac 
ggcacaaaaa 
cctcagtgct 
gcctccgggc 
ctggctgcct 
acttgctgct 
cgtaagtgtc 
ggtggccatc 
gtggggagaa 
ttttagttta 
taaaaatgtc 
aaatcttggc 
agtggatcac 
ctctactaaa 
tcaggagact 
gatcacgcca 
aaaaaaaaat 
ttaagcaaaa 
gcagtgcgat 
cagcctctgg 
ttttgttttt 
cctcaggtga 
catgctggcc 
tggtcttgaa 
aggcatgaac 
agtttcactc 
cgccccccgg 
cctgccacca 
gccaggctgg 
cctcccaaag 
gtttgaaaac 
cgtatctagc 
taaaggggaa 
agggacctaa 
tctcttctgt 
cgtggtcaca 
atggacactt 
gttgccagtc 
gtccaagata 
gctgaggcct 
gctcagtgtc 
tggggatttg 
ttctacgtct 



agagctccag 
gtgcatgagt 
tatgtgggta 
ttgtcacaaa 
aattaaacca 
gaggctgagg 
acacccctgc 
cctgtgcctg 
gacctatgaa 
gggatttgta 
acctataagc 
ccccagcccc 
cgggggtggc 
cagggaggac 
gtacgggggg 
ctgactctag 
cctggcctga 
ggctgaggag 
agggtgtggc 
tctgtgcagg 
tggggaatgg 
atccagccag 
ggctccgcac 
tcagggcctg 
tcctcatctt 
gctacgtcag 
ccgctcatgg 
cacctgcccc 
agtaggctga 
tttttattta 
caacagtaca 
taggcgcagt 
ttgaggtcag 
aatacaaaaa 
gaggcatgag 
ctgcactcca 
gttaagtgaa 
aaaatttttt 
catagctctc 
agcagctggg 
tgtagagacg 
tcctccgaag 
aatttttaaa 
ctcttgacct 
taccacacct 
tgtcgcccag 
gtttaagcta 
cgcctgacta 
tctcgaactc 
tgttaggact 
acatacccac 
acaccagcat 
gtaagcaaac 
ctctatgaga 
gaatgtttgt 
gccatcagcc 
cagcatccct 
cttggtttcc 
catgaacagt 
gcttgacacc 
atcccctata 
gcgagctcat 
gtcagctgct 
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gggcactcga 
agagatggag 
tttttgcctt 
gagtttattc 
tttcagccag 
tgggaggatc 
actccagact 
ctggaacagg 
tgagggaaac 
ttggagccct 
acccacatgt 
ttcccgcttc 
acagcctggg 
ctcaggggtg 
tggatcctga 
ttagtatccc 
ctgtggctct 
agcctggggt 
gtctgggcag 
ctgcctcgtt 
ccgggatggt 
ggctgcaggt 
catgtacccc 
ttgtctttct 
cctcctcctg 
gtgcccctgc 
ccaccctggt 
caggacagtg 

ggagggtctg 

tgttgttctt 
agatactagt 
gattacgcct 
gagttccaga 
ttagctgggc 
aatcgcttaa 
gcctgggcga 
aaagttaaga 
ttgttttaga 
tgcagcctca 
actgtaggca 
gggtctcagt 
tcagcctccc 
aattttctgt 
caagtgatcc 
ggccttaaac 
gctggagtaa 
ttctcctgcc 
atttttgtat 
ctgacctcag 
acaggtgtga 
aggaatgctg 
ggctactgtg 
caagcagggg 
ggagtcagac 
gaatatccca 
ttgggacacc 
ctagctgctg 
ttcaggaggg 
gaagtgccag 
ttcctgctca 
aaatggggct 
gtgcacacca 
gggcactgcc 



gggaagacaa 
accttgtggg 
catgggtccc 
ctgtatattg 
gtgtggtgac 
gcctgagccc 
gggcgacaga 
acatccctat 
ttagggtgcc 
cgtggcataa 
gggcgggtgg 
agggcacacc 
ctggctctgg 
gctgagctct 
gttgggcttc 
cttccccact 
gaggcatcta 
gcaggccagg 
ctggctctct 
ttctgccttg 
agaatgggga 
tccttggtga 
tgctgtgccg 
gcagactggc 
ggcatytgct 
tgcccagaca 
ttgggcaaca 
gcgttggtct 
ctgtttagat 
ttcttttgta 
cacatggaag 
gtaatcccag 
tcagcctggc 
atggtggtga 
acccaagaag 
cagagtgaga 
aaccaaacaa 
gaaagggtct 
aactcccggg 
cacaccacca 
atgttgccca 
caaagtgctg 
agagacaggg 
tgcctcaggc 
ttaagcaaat 
agtggcgtga 
tcagcctccc 
ttttagtaga 
gcaatccgct 
gccaccatgc 
cacattttac 
gagagctggg 
aaggtaagag 
atgtgcaatt 
acgggacact 
agaccacagt 
tcccgtttcc 
tgtctgggta 
gcaatccttg 
gaagcccagt 
tatggcagag 
ttgagcatgg 
actatcagcc 



gaaccggctc 

tcccagaacc 

ttcctgtttt 

tgttaatttg 

acatgcctgt 

acgaggttga 

gctgagatcc 

caccgtggtt 

caagctgagg 

cacaggtgga 

agggtaggag 

ttgcacttgg 

ccagcataat 

tacctggttt 

tcrggagctc 

aaaccctgct 

gtggtctggc 

gctctgtgac 

ctttggtctg 

tgttttttgc 

gtgtgctgtg 

gctttgcaaa 

ttagccctgt 

tcttcgtggt 

ggtgccagtg 

agtgctgctg 

tcctgcatcc 

ggagggtgtg 

tgtcgtttac 

agtataatcc 

taaagccctc 

cactttggga 

caacatggta 

tcgcctgtaa 

tggaggttgc, 

ctctgtctca 

ggtttacaac 

cattctgtca 

ttcaagcagt 

tgcccagcta 

gcctgatctc 

ggattacagg 

tcttgctatg 

tcccaaagtg 

tttttttttt 

tctctgctca 

gagtagctgg 

gacggggttt 

cccccgcacc 

ctggccaaat 

ccagctacta 

actggatgtg 

aagacagaag 

gaaaaagact 

ttcacagagg 

gtgtacacta 

cctcctcggg 

gaccagcctg 

caagcatggg 

gagcagtttc 

ctcaccacac 

ggcccaacct 

tcagtagtga 



14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
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ctgagggaca gggcaccagt cagagccctg gtgcacacag agtgacccca gagaagcagc 18180 

cttccctctc tgagtcctgt ttccttctgt taggtcctga cttcatgggt tgttgttagc 18240 

attaaggaag tcgctggcta attttatagt cattgaagtc agtggtgtgc aacctggttc 18300 

ctcaaaggat cacttccctg aaaaaattcc actgctccct ggaggcttat gcaggccatc 18360 

ccatcccctc cctcttgttg tgttcagctg acagcttttt gctcagtgag taagtgttag 18420 

gtccatttca cagatgggct gcaaccaagt ttgcagtgaa cccactaaga ccagagctag 184 80 

ggccaggact aaatgctggt cccaatgcca cattcccctg tccccacacc acatttcctc 18540 

catccggaga ccctgttacc ccaacccagg gccccattaa ctccctggca gaggccctgt 18600 

tacatctgct gctgccacag cctccgccca cccttcagga ggcagcaggt cccactgctg 18660 

atgataaagt tgcaggctgc ctgagctaat gaaggggctt cctctaggct gtgcacttag 18720 

tcttctgctt ccaaaccaaa tcagaggtga ggcaccctct ctgggcccat ctctctcctc 18780 

cattttcctg ttggggtccc agggaggaag ccacttgcct agggcccagg aattttgcaa 1884 0 

gcctcttgcc ctagggagga aggaagggag gaggatctta ccttgaactg tcaagcctag 18900 

agcctggtgg ggcaggcaga aatgggtgca gtccatgagt tagaaacact agaggagaca 18960 

ctttgctgct tggccggggc aggcaagtta attcccgagg ctcctgccac tgcatctcaa 19020 

tctggaaggt gaccaggtgg ggcaggaccc acgtctccca gatgactcat tttttctaga 19080 

acaggggctt ggctgccaaa gaggatactt gatttcggct tgtggggaca gtggtggacc 19140 

cagcatctgg gctttatata aagggcagct ttgttgccct gtaaacacac agaccatggg 19200 

tggccacttc ttccagtaag ttagctgggg agttggaagt ttaggtaaaa ccttttgatt 19260 

gacaaatgtt ggcgaattac catgctgtta aatgaaacat tgttctgcca ccctggggct 19320 

gtgggtgcct gcgtgcaccc tctgaaaaat cacacaggaa gtggggtggg gtctctgtga 19380 

agctggtgtc ccccagcctc agggatgctg cagaaatgga atgaggacca acagggactc 1944 0 

agatgtccaa ggaagctcta cagcggagag gacggcttgg gaaggaggtc caggcccagg 19500 

tccctccgga acccaatggg tatggggcag cctggctcct gcctcatccc ccttctcctg 19560 

ttgattrtgt cctcacagtg tatgccgccg gcaaagcagc cacctcaggt gttcccagca 19620 

tttatgcccc cagcacctat gcccacctgt ctcccgccaa gaccccaccc ccaccagcta 19680 

tgattcccat gggccctgcc tacaacgggt accctggagg ataccctgga gacgttgaca 1974 0 

ggartagctc aggtgaggcc gggggaagca ggaacagctg gtgggmgtgt gctgggcatc 19800 

tggacactga ggggcagggg ctggaaggaa gagtgtcttg ggagccgagg aggggctctg 19860 

ctcctggtgc gcggccactg acagccactc tcccccagct ggtggccaag gctcctatgt 19920 

acccctgctt cgggacacgg acagcagtgt ggcctctggt gagaatccat cgtcccgaag 19980 

ttggatgtgc ctgtaaggga gaggggtggg ccaggatcca tcctcccaaa ccgaccacca 2004 0 

cccccctgtc cctagaagtc cgcagtggct acaggattca ggccagccag caggacgact 20100 

ccatgcgggt cctgtactac atggagaagg agctggccaa cttcgaccct tctcgacstg 20160 

gcccccccag tggccgtgtg gagcggggta agcaggagcc ttggggtctg agggctttta 20220 

aggtgggggg gtgaaacatg tctccctgat acctgccgca gggactcttg gtgcaaaccc 202 80 

tggaccccgg gctcctccag cagtcagtga cacccccctt ccctgcagcc atgagtgaag 20340 

tcacctccct ccacgaggac gactggcgat ctcggccttc ccggggccct gccctcaccc 20400 

cgatccggga tgaggagtgg ggtggccact ccccccggag tcccagggga tgggaccagg 20460 

agcccgccag ggagcaggca ggcgggggct ggcgggccag gcggccccgg gcccgctccg 2 0520 

tggacgccct ggacgacctc accccgccga gcaccgccga gtcagggagc aggtctccca 20580 

cgagtaatgg tgggaggaga agccgggcct acatgccccc gcggagccgc agccgggacg 20640 

acctctatga ccaagacgac tcgagggact tcccacgctc ccgggacccc cactacgacg 20700 

acttcaggtc tcgggagcgc cctcctgccg accccaggtc ccaccaccac cgtacccggg 20760 

accctcggga caacggctcc aggtccgggg acctccccta tgatgggcgg ctactggagg 20820 

aggctgtgag gaagaagggg tcggaggaga ggaggagacc ccacaaggag gaggaggaag 20880 

aggcctacta cccgcccgcg ccgcccccgt actcggagac cgactcgcag gcgtcccgag 2094 0 

agcgcaggct caagaaggtg agggccgccc tccctggcgt ccagaccgtc cctgggcccc 21000 

cagccggtcc ccgcggctca tacccttctt tctttctccc ttgcagaact tggccctgag 21060 

tcgggaaagt ttagtcgtct gatctgacgt tttctacgta gcttttgkat tttttttttt 21120 

aatttgaagg aacactgatg aagccctgcc atacccctcc cgagtctaat aaaacgtata 21180 

atcacaagct ctggagagaa ccatttgttc ggccgcgcgg ggcgggggac cggggctgct 21240 

cccgtatgcg tctgtaaagc gccgcgtccc gggggcaccg gagtccgggg ccgggaggaa 21300 

gagacccagc ctggcccggc ccgcgcccgc gccgccggcc ggagaacgtg ccccgcgcag 21360 

ccaccgcccg cctgcgtgcg cgccccggcc ccgcccaggc gtgcgcatgc gccccggccc 2142 0 

tccgccttcg cgcaccgcag gctggccgcc gggagcgcgc gcgcgctcct ctccccttcc 21480 

agcccatccc ccccagcccc ccaccgacct actttactgt ctccaaactc gggcagccca 21540 

cctggccccc gacgacccca gcccctgctc cgggtacccc gacgttccat ccagacccgc 21600 

gtttcaccag ggcggcgcgc ggcgacctcg cgccccgcgg agccccgggc tcgcgcgcgc 21660 

ccgcccgccc ccggagacag acagcgcgcg cgctcccggg ccgcctcccc ccagcgcgcg 21720 

tccgccccgg gctcgcgccg ccgccgccgc cgccgccgcg cgcgcgcagc tcaagtaaag 21780 

gaggaaaaaa aaaaggggga aaaatagaaa gcggcggcgg ctgcagcagc gatccgccgc 21840 

cggactgggc caagccgggc ggcggccgcg cgagccggcg atccagggca ctggcggcgg 21900 



WO 01/21647 



24 



PCT/IB00/01470 



ccagccaggg cgggccgtgt tcaaaaaaaa aagtcgcggc ggcggcggct gctcagggaa 21960 

ggaggcctga gggccgcgtg cagcgggcgg gcagctgggt gggctggggg cggccgcgcg 22020 

gcgtcccgga gcctcgggcc gcccggagcc ggcgggcggg cggaggcgga ggcggcggcg 22080 

gctgcagcgg ctgcaggagc ggcggcggct gcngcggcgg cngcggcatc tcctcctcac 22140 

atgaccccac tgtttgtccc cgtgatcagc gcgagcggct cccgtatctc ctccgtcccc 22200 

tcctgccgcg cggcgtgagc gccgggnctc ggggcccccc cggccgcccg ccccctcccc 22260 

tccntccntc ccctcccctc ccctcccccc cgggccccgc gccccccccg cccccgcccc 2232 0 

ccccatggac atgctggacc cgggtctgga tcccgctgcc tcggccaccg ctgctgccgc 22380 

cgccaggtaa gatccccggc ccggccgtgc ccccgcgccc cggccccggc cccggccccg 2244 0 

cggcctgcag gccggggccg ccatgatccc gagcggccgc gggccccgct caaaatggag 22500 

gccgccggcg cgggggggac ctggcgcctc ccgcccccgg cccccggcct cggcggcgcc 22560 

cccggcctca ggcgcggccg ggtgggactg gggccctgca gctgggcgcg ggggcggggg 22620 

cgcgggcgcg ggccgcgctg accctgctcc ctcctgtgcc cctggcagcc acgacaaggg 22680 

acccgaggcg gaggagggcg tcgagctgca ggaaggtgag tgcttgccgg gccggccgcg 2274 0 

cccggggagg gctgggggcg ctcggcgcgg ccctgaccgt gccccgaccc tcctcggccc 22800 

caggcgggga cggcccagga gcggaggagc agacagcggt ggccatcacc agcgtccagc 22860 

aggcggcgtt cggcgaccac aacatccagt accagttccg cacagagaca aatggaggac 22920 

aggtgagcgg cgggccgcga gagcgaacgg gcgggcgggc gggcgcgccg ggaaggctcg 22980 

gacctggccc cagcgccggc ctcgccgctc tgccgccccc tgcaggtgac ataccgcgta 2304 0 

gtccaggtga ctgatggtca gctggacggc cagggcgaca cagctggcgc cgtcagcgtc 23100 

gtgtccaccg ctgccttcgc gggggggcag caggctgtga cccaggtggg tgtggacggg 23160 

gcagcccagc gcccgggccc cgccgct 23187 

<210> 2 

<211> 2158 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> allele 
<222> 595 

<223> 9-3-324 : polymorphic base C or T 
<220> 

<221> allele 
<222> 940 

<223> 9-6-187 : polymorphic base C or T 
<220> 

<221> allele 
<222> 1191 

<223> 9-7-325 : polymorphic base A or G 
<220> 

<22l> allele 
<222> 1362 

<223> 9-9-246 : polymorphic base G or C 
<220> 

<221> allele 
<222> 1658 

<223> LSRX9fl3-BM : polymorphic base deletion of AGG 
<220> 

<221> allele 
<222> 2079 

<223> LSRX9fl4-BM : polymorphic base T or G 
<400> 2 

tggagtgtgg ctcggaggac cgcggcgggt caagcacctt tctcccccat atctgaaagc 60 
atgccctttg tccacgtcgt ttacgctcat taaaacttcc aga atg caa cag gac 115 

Met Gin Gin Asp 

1 
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gga 


Ctt 


gga 


gta 


ggg 


aca 


agg 


aac 


gga 


agt 


ggg 


aag 


ggg 


agg 


age 


gtg 


163 


Gly Leu 


Gly 


Val 


Gly 


Thr 


Arg 


Asn 


Gly 


Ser 


Gly 


Lys 


Gly 


Arg 


Ser 


Val 




5 










10 










15 










20 




cac 


CCC 


tec 


tgg 


cct 


tgg 


tgc 


gcg 


ccg 


cgc 


ccc 


eta 


agg 


tac 


ttt 


gga 


211 


His 


Pro 


Ser 


Trp 


Pro 
25 


Trp 


Cys 


Ala 


Pro 


Arg 
30 


Pro 


Leu 


Arg 


Tyr 


Phe 
35 


Gly 




agg 


gac 


gcg 


egg 


gcc 


aga 


cgc 


gcc 


cag 


acg 


gcc 


gcg 


atg 


gcg 


ctg 


ttg 


259 


Arg Asp 


Ala 


Arg 


Ala 


Arg 


Arg 


Ala 


Gin 


Thr 


Ala 


Ala 


Met 


Ala 


Leu 


Leu 










40 










45 










50 








gcc 


ggc 


ggg 


etc 


tec 


aga 


ggg 


ctg 


ggc 


tec 


cac 


ccg 


gcc 


gcc 


gca 


ggc 


307 


Ala Gly 


Gly 


Leu 


Ser 


Arg 


Gly 


Leu 


Gly 


Ser 


His 


Pro 


Ala 


Ala 


Ala 


Gly 








55 










60 










65 










egg 


gac 


gcg 


gtc 


gtc 


ttc 


gtg 


tgg 


ctt 


ctg 


ctt 


age 


acc 


tgg 


tgc 


aca 


355 


Arg Asp 


Ala 


Val 


Val 


Phe 


Val 


Trp 


Leu 


Leu 


Leu 


Ser 


Thr 


Trp 


Cys 


Thr 






70 










75 










80 












get 


cct 


gcc 


agg 


gcc 


ate 


cag 


gtg 


acc 


gtg 


tec 


aac 


ccc 


tac 


cac 


gtg 


403 


Ala 


Pro 


Ala 


Arg 


Ala 


He 


Gin 


Val 


Thr 


val 


Ser 


Asn 


Pro 


Tyr 


His 


Val 




85 










90 










95 










100 




gtg 


ate 


etc 


ttc 


cag 


cct 


gtg 


acc 


ctg 


ccc 


tgt 


acc 


tac 


cag 


atg 


acc 


451 


Val 


He 


Leu 


Phe 


Gin 
105 


Pro 


Val 


Thr 


Leu 


Pro 
110 


Cys 


Thr 


Tyr 


Gin 


Met 
115 


Thr 




teg 


ace 


ccc 


acg 


caa 


ccc 


ate 


gtc 


ate 


tgg 


aag 


tac 


aag 


tct 


ttc 


tgc 


499 


Ser 


Thr 


Pro 


Thr 
120 


Gin 


Pro 


He 


Val 


He 
125 


Trp 


Lys 


Tyr 


Lys 


Ser 
130 


Phe 


Cys 




egg 


gac 


cgc 


ate 


gcc 


gat 


gcc 


ttc 


tec 


ccg 


gcc 


age 


gtc 


gac 


aac 


cag 


547 


Arg Asp 


Arg 


He 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 








135 










140 










145 










etc 


aat 


gcc 


cag 


ctg 


gca 


gcc 


ggg 


aac 


cca 


ggc 


tac 


aac 


ccc 


tac 


gty 


595 


Leu 


Asn 
150 


Ala 


Gin 


Leu 


Ala 


Ala 
155 


Gly 


Asn 


Pro 


Gly 


Tyr 
160 


Asn 


Pro 


Tyr 


Val 




gag 


tgc 


cag 


gac 


age 


gtg 


cgc 


acc 


gtc 


agg 


gtc 


gtg 


gcc 


acc 


aag 


cag 


643 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 


Val 


Val 


Ala 


Thr 


Lys 


Gin 




165 










170 










175 










180 




ggc 


aac 


get 


gtg 


acc 


ctg 


gga 


gat 


tac 


tac 


cag 


ggc 


egg 


agg 


att 


acc 


691 


Gly Asn 


Ala 


Val 


Thr 


Leu 


Gly 


Asp 


Tyr 


Tyr 


Gin 


Gly 


Arg 


Arg 


He 


Thr 












185 










190 










195 






ate 


acc 


gga 


aat 


get 


gac 


ctg 


acc 


ttt 


gac 


cag 


acg 


gcg 


tgg 


ggg 


gac 


739 


He 


Thr 


Gly 


Asn 
200 


Ala 


Asp 


Leu 


Thr 


Phe 
205 


Asp 


Gin 


Thr 


Ala 


Trp 
210 


Gly 


Asp 




agt 


ggt 


gtg 


tat 


tac 


tgc 


tec 


gtg 


gtc 


tea 


gcc 


cag 


gac 


etc 


cag 


ggg 


787 


Ser Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 


Ser 


Ala 


Gin 


Asp 


Leu 


Gin 


Gly 








215 










220 










225 










aac 


aat 


gag 


gcc 


tac 


gca 


gag 


etc 


ate 


gtc 


ctt 


ggg 


agg 


acc 


tea 


ggg 


835 


Asn 


Asn 
230 


Glu 


Ala 


Tyr 


Ala 


Glu 
235 


Leu 


He 


Val 


Leu 


Gly 
240 


Arg 


Thr 


Ser 


Gly 




gtg 


get 


gag 


etc 


tta 


cct 


ggt 


ttt 


cag 


gcg 


ggg 


ccc 


ata 


gaa 


gac 


tgg 


883 


Val 


Ala 


Glu 


Leu 


Leu 


Pro 


Gly 


Phe 


Gin 


Ala 


Gly 


Pro 


He 


Glu 


Asp 


Trp 




245 










250 










255 










260 




etc 


ttc 


gtg 


gtt 


gtg 


gta 


tgc 


ctg 


get 


gcc 


ttc 


etc 


ate 


ttc 


etc 


etc 


931 


Leu 


Phe 


Val 


Val 


Val 
265 


Val 


Cys 


Leu 


Ala 


Ala 
270 


Phe 


Leu 


He 


Phe 


Leu 
275 


Leu 




ctg 


ggc 


aty 


tgc 


tgg 


tgc 


cag 


tgc 


tgc 


ccg 


cac 


act 


tgc 


tgc 


tgc 


tac 


979 


Leu Gly 


He 


Cys 


Trp 


Cys 


Gin 


Cys 


Cys 


Pro 


His 


Thr 


Cys 


Cys 


Cys 


Tyr 










280 










285 










290 








gtc agg 


tgc 


ccc 


tgc 


tgc 


cca 


gac 


aag 


tgc 


tgc 


tgc 


ccc 


gag 


gcc 


ctg 


1027 


Val 


Arg 


Cys 
295 


Pro 


Cys 


Cys 


Pro 


Asp 
300 


Lys 


Cys 


Cys 


Cys 


Pro 
305 


Glu 


Ala 


Leu 




tat 


gcc 


gcc 


ggc 


aaa 


gca 


gcc 


acc 


tea 


ggt 


gtt 


ccc 


age 


att 


tat 


gcc 


1075 


Tyr Ala 


Ala 


Gly 


Lys 


Ala 


Ala 


Thr 


Ser 


Gly 


Val 


Pro 


Ser 


He 


Tyr 


Ala 






310 










315 










320 












ccc 


age 


acc 


tat 


gcc 


cac 


ctg 


tct 


ccc 


gcc 


aag 


acc 


cca 


ccc 


cca 


cca 


1123 


Pro 


Ser 


Thr 


Tyr 


Ala 


His 


Leu 


Ser 


Pro 


Ala 


Lys 


Thr 


Pro 


Pro 


Pro 


Pro 




325 










330 










335 










340 
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get 


atg 


att 


ccc 


atg 


ggc 


cct 


gec 


tac 


aac 


ggg 


tac 


cct 


gga 


gga 


tac 


1171 


Ala 


Met 


lie 


Pro 


Met 


Gly 


Pro 


Ala 


Tyr 


Asn 


Gly 


Tyr 


Pro 


Gly 


Gly Tyr 












345 










350 










355 






cct 


gga 


gac 


gtt 


gac 


agg 


art 


age 


tea 


get 


ggt 


ggc 


caa 


ggc 


tec 


tat 


1219 


Pro 


Gly 


Asp 


Val 


Asp 


Arg 


Xaa 


Ser 


Ser 


Ala 


Gly 


Gly 


Gin 


Gly 


Ser Tyr 










360 










365 










370 








gta 


ccc 


ctg 


ctt 


egg 


gac 


acg 


gac 


age 


agt 


gtg 


gee 


tct 


gaa 


gtc 


cgc 


1267 


Val 


Pro 


Leu 


Leu 


Arg 


Asp 


Thr 


Asp 


Ser 


Ser 


Val 


Ala 


Ser 


Glu 


Val 


Arg 








375 










380 










385 










agt 


ggc 


tac 


agg 


att 


cag 


gec 


age 


cag 


cag 


gac 


gac 


tec 


atg 


egg gtc 


1315 


Ser 


Gly 


Tyr 


Arg 


lie 


Gin 


Ala 


Ser 


Gin 


Gin 


Asp 


Asp 


Ser 


Met 


Arg Val 






390 










395 










400 












ctg 


tac 


tac 


atg 


gag 


aag 


gag 


ctg 


gee 


aac 


ttc 


gac 


cct 


tct 


cga 


est 


1363 


Leu 


Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


Phe 


Asp 


Pro 


Ser 


Arg 


Xaa 




405 










410 










415 










420 




ggc 


ccc 


ccc 


agt 


ggc 


cgt 


gtg 


gag 


egg 


gec 


atg 


agt 


gaa 


gtc 


acc 


tec 


1411 


Gly 


Pro 


Pro 


Ser 


Gly 


Arg 


Val 


Glu 


Arg 


Ala 


Met 


Ser 


Glu 


Val 


Thr 


Ser 












425 










430 










435 






etc 


cac 


gag 


gac 


gac 


tgg 


cga 


tct 


egg 


cct 


tec 


egg 


ggc 


cct 


gee 


etc 


1459 


Leu 


His 


Glu 


Asp 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


Gly 


Pro 


Ala 


Leu 










440 










445 










450 








acc 


ccg 


ate 


egg 


gat 


gag 


gag 


tgg 


ggt 


ggc 


cac 


tec 


ccc 


egg 


agt 


ccc 


1507 


Thr 


Pro 


He 


Arg 


Asp 


Glu 


Glu 


Trp 


Gly 


Gly 


His 


Ser 


Pro 


Arg 


Ser 


Pro 








455 










460 










465 










agg 


gga 


tgg 


gac 


cag 


gag 


ccc 


gee 


agg 


gag 


cag 


gca 


ggc 


ggg 


ggc 


tgg 


1555 


Arg 


Gly 


Trp 


Asp 


Gin 


Glu 


Pro 


Ala 


Arg 


Glu 


Gin 


Ala 


Gly 


Gly 


Gly Trp 






470 










475 










480 












egg 


gee 


agg 


egg 


ccc 


egg 


gee 


cgc 


tec 


gtg 


gac 


gec 


ctg 


gac 


gac 


etc 


1603 


Arg 


Ala 


Arg 


Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 


Ala 


Leu 


Asp 


Asp 


Leu 




485 










490 










495 










500 




acc 


ccg 


ccg 


age 


acc 


gee 


gag 


tea 


ggg 


age 


agg 


tct 


ccc 


acg 


agt 


aat 


1651 


Thr 


Pro 


Pro 


Ser 


Thr 


Ala 


Glu 


Ser 


Gly 


Ser 


Arg 


Ser 


Pro 


Thr 


Ser 


Asn 












505 










510 










515 






ggt 


ggg 


aga 


age 


egg 


gee 


tac 


atg 


ccc 


ccg 


egg 


age 


cgc 


age 


egg 


gac 


1699 


Gly 


Gly 


Arg 


Ser 


Arg 


Ala 


Tyr 


Met 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 


Arg 


Asp 










520 










525 










530 








gac 


etc 


tat 


gac 


caa 


gac 


gac 


teg 


agg 


gac 


ttc 


cca 


cgc 


tec 


egg 


gac 


1747 


Asp 


Leu 


Tyr 


Asp 


Gin 


Asp 


Asp 


Ser 


Arg 


Asp 


Phe 


Pro 


Arg 


Ser 


Arg 


Asp 








535 










540 










545 










ccc 


cac 


tac 


gac 


gac 


ttc 


agg 


tct 


egg 


gag 


cgc 


cct 


cct 


gec 


gac 


ccc 


1795 


Pro 


His 


Tyr 


Asp 


Asp 


Phe 


Arg 


Ser 


Arg 


Glu 


Arg 


Pro 


Pro 


Ala 


Asp 


Pro 






550 










555 










560 












agg 


tec 


cac 


cac 


cac 


cgt 


acc 


egg 


gac 


cct 


egg 


gac 


aac 


ggc 


tec 


agg 


1843 


Arg 


Ser 


His 


His 


His 


Arg 


Thr 


Arg 


Asp 


Pro 


Arg 


Asp 


Asn 


Gly 


Ser 


Arg 




565 










570 










575 










580 




tec 


ggg 


gac 


etc 


ccc 


tat 


gat 


ggg 


egg 


eta 


ctg 


gag 


gag 


get 


gtg 


agg 


1891 


Ser 


Gly 


Asp 


Leu 


Pro 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 


Glu 


Glu 


Ala 


Val 


Arg 












585 










590 










595 






aag 


aag 


ggg 


teg 


gag 


gag 


agg 


agg 


aga 


CCC 


cac 


aag 


gag 


gag 


gag 


gaa 


1939 


Lys 


Lys 


Gly 


Ser 


Glu 


Glu 


Arg 


Arg 


Arg 


Pro 


His 


Lys 


Glu 


Glu 


Glu 


Glu 










600 










605 










610 








gag 


gee 


tac 


tac 


ccg 


ccc 


gcg 


ccg 


ccc 


ccg 


tac 


teg 


gag 


acc 


gac 


teg 


1987 


Glu 


Ala 


Tyr 


Tyr 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 


Glu 


Thr 


Asp 


Ser 








615 










620 










625 










cag 


gcg 


tec 


cga 


gag 


cgc 


agg 


etc 


aag 


aag 


aac 


ttg 


gee 


ctg 


agt 


egg 


2035 


Gin 


Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


Leu 


Lys 


Lys 


Asn 


Leu 


Ala 


Leu 


Ser Arg 





630 635 640 

gaa agt tta gtc gtc tga tctgacgttt tetaegtage ttttgkattt 2083 
Glu Ser Leu Val Val * 
645 650 

ttttttttaa tttgaaggaa cactgatgaa gccctgccat acccctcccg agtctaataa 2143 
aacgtataat cacaa 2158 
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<210> 3 
<211> 649 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> VARIANT 

<222> 363 

<223> 9-7-325 : polymorphic amino acid Ser or Asn 
<220> 

<221> VARIANT 

<222> 420 

<223> 9-9-246 : polymorphic amino acid Pro or Arg 
<220> 

<221> VARIANT 

<222> 519 

<223> LSRX9fl3-BM : polymorphic amino acid deletion of Arg 



<400> 3 



Met Gin 


Gin 


Asp 


Gly 


Leu 


Gly 


Val 


Gly 


Thr 


Arg 


Asn 


Gly 


Ser 


Gly 


Lys 


1 






5 










10 










15 




Gly Arg 


Ser 


Val 
20 


His 


Pro 


Ser 


Trp 


Pro 
25 


Trp 


Cys 


Ala 


Pro 


Arg 
30 


Pro 


Leu 


Arg Tyr 


Phe 
35 


Gly 


Arg 


Asp 


Ala 


Arg 
40 


Ala 


Arg 


Arg 


Ala 


Gin 
45 


Thr 


Ala 


Ala 


Met Ala 


Leu 


Leu 


Ala 


Gly 


Gly 


Leu 


Ser 


Arg 


Gly 


Leu 


Gly 


Ser 


His 


Pro 


50 










55 










60 










Ala Ala 


Ala 


Gly 


Arg 


Asp 


Ala 


Val 


Val 


Phe 


Val 


Trp 


Leu 


Leu 


Leu 


Ser 


65 








70 










75 










60 


Thr Trp 


Cys 


Thr 


Ala 
85 


Pro 


Ala 


Arg 


Ala 


He 
90 


Gin 


Val 


Thr 


Val 


Ser 
95 


Asn 


Pro Tyr 


His 


Val 
100 


val 


He 


Leu 


Phe 


Gin 
105 


Pro 


Val 


Thr 


Leu 


Pro 
110 


Cys 


Thr 


Tyr Gin 


Met 
115 


Thr 


Ser 


Thr 


Pro 


Thr 
120 


Gin 


Pro 


He 


Val 


He 
125 


Trp 


Lys 


Tyr 


Lys Ser 


Phe 


Cys 


Arg 


Asp 


Arg 


He 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 


Ala 


Ser 


130 










135 










140 










Val Asp 


Asn 


Gin 


Leu 


Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 


Pro 


Gly 


Tyr 


145 








150 










155 










160 


Asn Pro 


Tyr 


Val 


Glu 
165 


Cys 


Gin 


Asp 


Ser 


Val 
170 


Arg 


Thr 


val 


Arg 


Val 
175 


Val 


Ala Thr 


Lys 


Gin 
180 


Gly 


Asn 


Ala 


Val 


Thr 
185 


Leu 


Gly 


Asp 


Tyr 


Tyr 
190 


Gin 


Gly 


Arg Arg 


He 
195 


Thr 


He 


Thr 


Gly 


Asn 
200 


Ala 


Asp 


Leu 


Thr 


Phe 
205 


Asp 


Gin 


Thr 


Ala Trp 


Gly 


Asp 


Ser 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 


Ser 


Ala 


Gin 


210 










215 










220 










Asp Leu 


Gin 


Gly 


Asn 


Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 


Leu 


He 


Val 


Leu 


Gly 


225 








230 










235 










240 


Arg Thr 


Ser 


Gly 


Val 
245 


Ala 


Glu 


Leu 


Leu 


Pro 
250 


Gly 


Phe 


Gin 


Ala 


Gly 
255 


Pro 


He Glu 


Asp 


Trp 
260 


Leu 


Phe 


Val 


Val 


Val 
265 


Val 


Cys 


Leu 


Ala 


Ala 
270 


Phe 


Leu 


He Phe 


Leu 
275 


Leu 


Leu 


Gly 


He 


Cys 
280 


Trp 


Cys 


Gin 


Cys 


Cys 
285 


Pro 


His 


Thr 


Cys Cys 


Cys 


Tyr 


Val 


Arg 


Cys 


Pro 


Cys 


Cys 


Pro 


Asp 


Lys 


Cys 


Cys 


Cys 


290 










295 










300 










Pro Glu 


Ala 


Leu 


Tyr 


Ala 


Ala 


Gly 


Lys 


Ala 


Ala 


Thr 


Ser 


Gly 


Val 


Pro 


305 








310 










315 










320 


Ser He 


Tyr 


Ala 


Pro 
325 


Ser 


Thr 


Tyr 


Ala 


His 
330 


Leu 


Ser 


Pro 


Ala 


Lys 
335 


Thr 
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Pro 


Pro 


Pro Pro 


Ala 


Met 


lie 


Pro 


Met 


Gly 


Pro 


Ala Tyr Asn 


Gly Tyr 






340 










345 






350 






Pro 


Gly 


Gly Tyr 


Pro 


Gly 


Asp 


Val 


Asp 


Arg 


Ser 


Ser Ser Ala 


Gly Gly 






355 








360 








365 






Gin 


Gly 


Ser Tyr 


Val 


Pro 


Leu 


Leu 


Arg 


Asp 


Thr 


Asp Ser Ser 


Val 


Ala 




370 








375 










380 






Ser 


Glu 


Val Arg 


Ser 


Gly 


Tyr 


Arg 


He 


Gin 


Ala 


Ser Gin Gin 


Asp 


Asp 


385 








390 










395 






400 


Ser 


Met 


Arg Val 


Leu 


Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu Ala Asn 


Phe 


Asp 








405 










410 






415 




Pro 


Ser 


Arg Pro 


Gly 


Pro 


Pro 


Ser 


Gly 


Arg 


Val 


Glu Arg Ala 


Met 


Ser 






420 










425 






430 






Glu 


Val 


Thr Ser 


Leu 


His 


Glu 


Asp 


Asp 


Trp 


Arg 


Ser Arg Pro 


Ser 


Arg 






435 








440 








445 






Gly 


Pro 


Ala Leu 


Thr 


Pro 


He 


Arg 


Asp 


Glu 


Glu 


Trp Gly Gly 


His 


Ser 




450 








455 










460 






Pro 


Arg 


Ser Pro 


Arg 


Gly 


Trp 


Asp 


Gin 


Glu 


Pro 


Ala Arg Glu 


Gin 


Ala 


465 








470 










475 






480 


Gly 


Gly 


Gly Trp 


Arg 


Ala 


Arg 


Arg 


Pro 


Arg 


Ala 


Arg Ser Val 


Asp 


Ala 








485 










490 






495 




Leu 


Asp 


Asp Leu 


Thr 


Pro 


Pro 


Ser 


Thr 


Ala 


Glu 


Ser Gly Ser 


Arg 


Ser 






500 










505 






510 






Pro 


Thr 


Ser Asn 


Gly 


Gly 


Arg 


Ser 


Arg 


Ala 


Tyr 


Met Pro Pro 


Arg 


Ser 






515 








520 








525 






Arg 


Ser 


Arg Asp 


Asp 


Leu 


Tyr 


Asp 


Gin 


Asp 


Asp 


Ser Arg Asp 


Phe 


Pro 




530 








535 










540 






Arg 


Ser 


Arg Asp 


Pro 


His 


Tyr 


Asp 


Asp 


Phe 


Arg 


Ser Arg Glu 


Arg 


Pro 


545 








550 










555 






560 


Pro 


Ala 


Asp Pro 


Arg 


Ser 


His 


His 


His 


Arg 


Thr 


Arg Asp Pro 


Arg Asp 








565 










570 






575 




Asn 


Gly 


Ser Arg 


Ser 


Gly 


Asp 


Leu 


Pro 


Tyr 


Asp 


Gly Arg Leu 


Leu 


Glu 






580 










585 






590 






Glu 


Ala 


Val Arg 


Lys 


Lys 


Gly 


Ser 


Glu 


Glu 


Arg 


Arg Arg Pro 


His 


Lys 






595 








600 








605 






Glu 


Glu 


Glu Glu 


Glu 


Ala 


Tyr 


Tyr 


Pro 


Pro 


Ala 


Pro Pro Pro 


Tyr 


Ser 




610 








615 










620 






Glu 


Thr 


Asp Ser 


Gin 


Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


Leu Lys Lys 


Asn 


Leu 


625 








630 










635 






640 


Ala 


Leu 


Ser Arg 


Glu 


Ser 


Leu 


Val 


Val 













645 



<210> 4 

<211> 2101 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 595 

<223> 9-3-324 : polymorphic base C or T 
<220> 

<221> allele 
<222> 883 

<223> 9-6-187 : polymorphic base C or T 
<220> 

<22l> allele 
<222> 1134 

<223> 9-7-325 : polymorphic base A or G 
<220> 

<221> allele 
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<222> 1305 
<223> 9-9-246 



polymorphic base G or C 



<220> 

<221> allele 

<222> 1601 

<223> LSRX9fl3-BM 



polymorphic base deletion of AGG 



<220> 

<221> allele 

<222> 2022 

<223> LSRX9fl4-BM 



polymorphic base T or G 



<400> 4 

tggagtgtgg ctcggaggac cgcggcgggt caagcacctt tctcccccat atctgaaagc 60 



atgccctttg tccacgtcgt ttacgctcat taaaacttcc 


aga 


atg 


caa 


cag 


gac 


115 
























Met 
1 


Gin 


Gin 


Asp 




aaa ctt 


QQa 


qta 




aca 


agg 

**33 


aac 


gga 


agt 


ggg 


aag 


ggg 


agg 


age 


gtg 


163 


Gly Leu 


Gly 


Val 


Gly 


Thr 


Arg 


Asn 


Gly 


Ser 


Gly 


Lys 


Gly 


Arg 


Ser 


Val 




5 








10 










15 










20 






tec 


tqq 


cct 


tQQ 


tgc 


gcg 

3 3 


ccg 


cgc 


ccc 


eta 


agg 


tac 


ttt 


gga 


211 


His Pro 


Ser 


Trp 


Pro 


Trp 


Cys 


Ala 


Pro 


Arg 


Pro 


Leu 


Arg 


Tyr 


Phe 


Gly 










25 










30 










35 






aQQ gac 


qCQ 


egg 


gcc 


aga 


cgc 


gcc 


cag 


acg 


gcc 


gcg 


atg 


gcg 


ctg 


ttg 


259 


Arg Asp 


Ala 


Arg 


Ala 


Arg 


Arg 


Ala 


Gin 


Thr 


Ala 


Ala 


Met 


Ala 


Leu 


Leu 








40 










45 










50 








gcc ggc 


999 


etc 


tec 


aga 


ggg 


ctg 


ggc 


tec 


cac 


ccg 


gcc 


gcc 


gca 


ggc 


307 


Ala Gly 


Gly 


Leu 


Ser 


Arg 


Gly 


Leu 


Gly 


Ser 


His 


Pro 


Ala 


Ala 


Ala 


Gly 






55 










60 










65 










egg gac 


gcg 


gtc 


gtc 


ttc 


gtg 


tgg 


ctt 


ctg 


ctt 


age 


acc 


tgg 


tgc 


aca 


355 


Arg Asp 


Ala 


Val 


Val 


Phe 


Val 


Trp 


Leu 


Leu 


Leu 


Ser 


Thr 


Trp 


Cys 


Thr 




70 










75 










80 












get cct 


gcc 


agg 


gcc 


ate 


cag 


gtg 


acc 


gtg 


tec 


aac 


ccc 


tac 


cac 


gtg 


403 


Ala Pro 


Ala 


Arg 


Ala 


He 


Gin 


Val 


Thr 


Val 


Ser 


Asn 


Pro 


Tyr 


His 


Val 




85 








90 










95 










100 




gtg ate 


etc 


ttc 


cag 


cct 


gtg 


acc 


ctg 


ccc 


tgt 


acc 


tac 


cag 


atg 


acc 


451 


Val He 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


Pro 


Cys 


Thr 


Tyr 


Gin 


Met 


Thr 










105 










110 










115 






teg acc 


ccc 


acg 


caa 


ccc 


ate 


gtc 


ate 


tgg 


aag 


tac 


aag 


tct 


ttc 


tgc 


499 


Ser Thr 


Pro 


Thr 


Gin 


Pro 


He 


Val 


He 


Trp 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 








120 










125 










130 








egg gac 


cgc 


ate 


gcc 


gat 


gcc 


ttc 


tec 


ccg 


gcc 


age 


gtc 


gac 


aac 


cag 


547 


Arg Asp 


Arg 


He 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 






135 










140 










145 










etc aat 


gcc 


cag 


ctg 


gca 


gcc 


ggg 


aac 


cca 


ggc 


tac 


aac 


ccc 


tac 


gty 


595 


Leu Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 


Pro 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 




150 










155 










160 












gag tgc 


cag 


gac 


age 


gtg 


cgc 


acc 


gtc 


agg 


gtc 


gtg 


gcc 


acc 


aag 


cag 


643 


Glu Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 


Val 


Val 


Ala 


Thr 


Lys 


Gin 




165 








170 










175 










180 




ggc aac 


get 


gtg 


acc 


ctg 


gga 


gat 


tac 


tac 


cag 


ggc 


egg 


agg 


att 


acc 


691 


Gly Asn 


Ala 


Val 


Thr 


Leu 


Gly 


Asp 


Tyr 


Tyr 


Gin 


Gly 


Arg 


Arg 


He 


Thr 










185 










190 










195 






ate acc 


gga 


aat 


get 


gac 


ctg 


acc 


ttt 


gac 


cag 


acg 


gcg 


tgg 


ggg 


gac 


739 


He Thr 


Gly 


Asn 


Ala 


Asp 


Leu 


Thr 


Phe 


Asp 


Gin 


Thr 


Ala 


Trp 


Gly 


Asp 








200 










205 










210 








agt ggt 


gtg 


tat 


tac 


tgc 


tec 


gtg 


gtc 


tea 


gcc 


cag 


gac 


etc 


cag 


ggg 


787 


Ser Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 


Ser 


Ala 


Gin 


Asp 


Leu 


Gin 


Gly 






215 










220 










225 










aac aat 


gag 


gcc 


tac 


gca 


gag 


etc 


ate 


gtc 


ctt 


gac 


tgg 


etc 


ttc 


gtg 


835 


Asn Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 


Leu 


He 


Val 


Leu 


Asp 


Trp 


Leu 


Phe 


Val 




230 










235 










240 
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gtt gtg 


gta 


tgc 


ctg 


get 


Val Val 


Val 


Cys 


Leu 


Ala 


245 








250 


tgc tgg 


tgc 


cag 


tgc 


tgc 


Cys Trp 


Cys 


Gin 


Cys 


Cys 








265 




ccc tgc 


tgc 


cca 


gac 


aag 


Pro Cys 


Cys 


Pro 


Asp 


Lys 






280 






ggc aaa 


gca 


gcc 


acc 


tea 


Gly Lys 


Ala 


Ala 


Thr 


Ser 




295 








tat gcc 


cac 


ctg 


tct 


ccc 


Tyr Ala 


His 


Leu 


Ser 


Pro 


310 










ccc atg 


ggc 


cct 


gcc 


tac 


Pro Met 


Gly 


Pro 


Ala 


Tyr 


325 








330 


gtt gac 


agg 


art 


age 


tea 


Val Asp 


Arg 


Xaa 


Ser 


Ser 








345 




ctt egg 


gac 


acg 


gac 


age 


Leu Arg 


Asp 


Thr 


Asp 


Ser 






360 






agg att 


cag 


gcc 


age 


cag 


Arg lie 


Gin 


Ala 


Ser 


Gin 




375 








atg gag 


aag 


gag 


ctg 


gcc 


Met Glu 


Lys 


Glu 


Leu 


Ala 


390 










agt ggc 


cgt 


gtg 


gag 


egg 


Ser Gly 


Arg 


Val 


Glu 


Arg 


405 








410 


gac gac 


tgg 


cga 


tct 


egg 


Asp Asp 


Trp 


Arg 


Ser 


Arg 








425 






gag 


gag 


tgg 


ggt 


Arg Asp 


Glu 


Glu 


Trp 


Gly 






440 






gac cag 


gag 


ccc 


gcc 


agg 


Asp Gin 


Glu 


Pro 


Ala 


Arg 




455 








egg ccc 


egg 


gcc 


cgc 


tec 


Arg Pro 


Arg 


Ala 


Arg 


Ser 


470 










age acc 


gcc 


gag 


tea 


ggg 


Ser Thr 


Ala 


Glu 


Ser 


Gly 


485 








490 


age egg 


gcc 


tac 


atg 


ccc 


Ser Arg 


Ala 


Tyr 


Met 


Pro 








505 




gac caa 


gac 


gac 


teg 


agg 


Asp Gin 


Asp 


Asp 


Ser 


Arg 






520 






gac gac 


ttc 


agg 


tct 


egg 


Asp Asp 


Phe 


Arg 


Ser 


Arg 




535 








cac cac 


cgt 


acc 


egg 


gac 


His His 


Arg 


Thr 


Arg 


Asp 


550 










etc ccc 


tat 


gat 


ggg 


egg 


Leu Pro 


Tyr 


Asp 


Gly 


Arg 


565 








570 



30 

gcc ttc etc ate ttc etc 
Ala Phe Leu lie Phe Leu 
255 

ccg cac act tgc tgc tgc 
Pro His Thr Cys Cys Cys 
270 

tgc tgc tgc ccc gag gcc 
Cys Cys Cys Pro Glu Ala 
285 

ggt gtt ccc age att tat 
Gly Val Pro Ser lie Tyr 
300 

gcc aag acc cca ccc cca 
Ala Lys Thr Pro Pro Pro 
315 320 
aac ggg tac cct gga gga 
Asn Gly Tyr Pro Gly Gly 
335 

get ggt ggc caa ggc tec 
Ala Gly Gly Gin Gly Ser 
350 

agt gtg gcc tct gaa gtc 
Ser Val Ala Ser Glu Val 
365 

cag gac gac tec atg egg 
Gin Asp Asp Ser Met Arg 
380 

aac ttc gac cct tct cga 
Asn Phe Asp Pro Ser Arg 
395 400 
gcc atg agt gaa gtc acc 
Ala Met Ser Glu Val Thr 
415 

cct tec egg ggc cct gcc 
Pro Ser Arg Gly Pro Ala 
430 

ggc cac tec ccc egg agt 
Gly His Ser Pro Arg Ser 
445 

gag cag gca ggc ggg ggc 
Glu Gin Ala Gly Gly Gly 
460 

gtg gac gcc ctg gac gac 
Val Asp Ala Leu Asp Asp 
475 480 
age agg tct ccc acg agt 
Ser Arg Ser Pro Thr Ser 
495 

ccg egg age cgc age egg 
Pro Arg Ser Arg Ser Arg 
510 

gac ttc cca cgc tec egg 
Asp Phe Pro Arg Ser Arg 
525 

gag cgc cct cct gcc gac 
Glu Arg Pro Pro Ala Asp 
540 

cct egg gac aac ggc tec 
Pro Arg Asp Asn Gly Ser 
555 560 
eta ctg gag gag get gtg 
Leu Leu Glu Glu Ala Val 
575 
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etc 


ctg 


ggc 


aty 


883 


Leu 


Leu 


Gly 


lie 










260 




tac 


gtc 


agg 


tgc 


931 


Tyr 


Val 


Arg 


Cys 








275 






ctg 


tat 


gcc 


gcc 


979 


Leu 


Tvr 


Ala 


Ala 






290 








gcc 


ccc 


age 


acc 


1027 


Ala 


Pro 


Ser 


Thr 




305 










cca 


get 


atg 


att 


1075 


Pro 


Ala 


Met 


He 




tac 


cct 


gga 


gac 


1123 


Tyr 


Pro 


Gly 


Asp 










340 




tat 


gta 


ccc 


ctg 


1171 


Tyr 


Val 


Pro 


Leu 








355 






cgc 


agt 


ggc 


tac 


1219 


Arg 


OCX 




TVr 






370 








gtc 


ctg 


tac 


tac 


1267 


Val 


Leu 


Tyr 


Tyr 




385 










est 


ggc 


ccc 


ccc 


1315 


Xaa 


Gly 


Pro 


Pro 




tec 


etc 


cac 


gag 


1363 


Ser 


Leu 


His 


Glu 










420 




etc 


acc 


ccg 


ate 


1411 


Leu 


Thr 


Pro 


He 








435 






ccc 


agg 


gga 


tgg 


1459 




Arg 




irp 






450 








tgg 


egg 


gcc 


agg 


1507 


Trp 


Arg 


Ala 


Arg 




465 










etc 


acc 


ccg 


ccg 


1555 


Leu 


Thr 


Pro 


Pro 




aat 


ggt 


ggg 


aga 


1603 


Asn 


Gly 


Gly 


Arg 










500 




gac 


gac 


etc 


tat 


1651 


Asp 


Asp 


Leu 


Tyr 








515 






gac 


ccc 


cac 


tac 


1699 


Asp 


Pro 


His 


Tyr 






530 








ccc 


agg 


tec 


cac 


1747 


Pro 


Arg 


Ser 


His 




545 










agg 


tec 


ggg 


gac 


1795 


Arg 


Ser 


Gly 


Asp 




agg 


aag 


aag 


ggg 


1843 


Arg 


Lys 


Lys 


Gly 





580 
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teg gag gag agg agg aga ccc cac aag gag gag gag gaa gag gec tac 18 91 

Ser Glu Glu Arg Arg Arg Pro His Lys Glu Glu Glu Glu Glu Ala Tyr 

585 590 595 

tac ccg ccc gcg ccg ccc ccg tac teg gag acc gac teg cag gcg tec 1939 
Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser 

600 605 610 

cga gag cgc agg etc aag aag aac ttg gee ctg agt egg gaa agt tta 1987 
Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu 

615 620 625 

gtc gtc tga tctgacgttt tetaegtage ttttgkattt ttttttttaa 2036 
Val Val * 
630 

tttgaaggaa cactgatgaa gccctgccat acccctcccg agtctaataa aacgtataat 2096 
cacaa 2101 



<210> 5 

<211> 630 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> VARIANT 

<222> 344 

<223> 9-7-325 : polymorphic amino acid Ser or Asn 



<220> 

<221> VARIANT 

<222> 401 

<223> 9-9-246 : polymorphic amino acid Pro or Arg 



<220> 

<221> VARIANT 

<222> 500 

<223> LSRX9fl3-BM : polymorphic amino acid deletion of Arg 



<400> 5 



Met 


Gin 


Gin 


Asp 


Gly 


Leu 


Gly 


Val 


Gly 


Thr 


Arg 


Asn 


Gly 


Ser 


Gly 


Lys 


1 








5 










10 










15 




Gly 


Arg 


Ser 


Val 


His 


Pro 


Ser 


Trp 


Pro 


Trp 


Cys 


Ala 


Pro 


Arg 


Pro 


Leu 








20 










25 










30 






Arg 


Tyr 


Phe 


Gly 


Arg 


Asp 


Ala 


Arg 


Ala 


Arg 


Arg 


Ala 


Gin 


Thr 


Ala 


Ala 






35 










40 










45 








Met 


Ala 


Leu 


Leu 


Ala 


Gly 


Gly 


Leu 


Ser 


Arg 


Gly 


Leu 


Gly 


Ser 


His 


Pro 




50 










55 










60 










Ala 


Ala 


Ala 


Gly 


Arg 


Asp 


Ala 


Val 


Val 


Phe 


Val 


Trp 


Leu 


Leu 


Leu 


Ser 


65 










70 










75 










80 


Thr 


Trp 


Cys 


Thr 


Ala 


Pro 


Ala 


Arg 


Ala 


He 


Gin 


Val 


Thr 


Val 


Ser 


Asn 










85 










90 










95 




Pro 


Tyr 


His 


Val 


Val 


He 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


Pro 


Cys 


Thr 








100 










105 










110 






Tyr 


Gin 


Met 


Thr 


Ser 


Thr 


Pro 


Thr 


Gin 


Pro 


He 


Val 


He 


Trp 


Lys 


Tyr 






115 










120 










125 








Lys 


Ser 


Phe 


Cys 


Arg 


Asp 


Arg 


He 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 


Ala 


Ser 




130 










135 










140 










Val 


Asp 


Asn 


Gin 


Leu 


Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 


Pro 


Gly 


Tyr 


145 










150 










155 










160 


Asn 


Pro 


Tyr 


Val 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 


Val 


Val 










165 










170 










175 




Ala 


Thr 


Lys 


Gin 


Gly 


Asn 


Ala 


Val 


Thr 


Leu 


Gly 


Asp 


Tyr 


Tyr 


Gin 


Gly 








180 










185 










190 






Arg 


Arg 


lie 


Thr 


He 


Thr 


Gly 


Asn 


Ala 


Asp 


Leu 


Thr 


Phe 


Asp 


Gin 


Thr 






195 










200 










205 








Ala 


Trp 


Gly 


Asp 


Ser 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 


Ser 


Ala 


Gin 
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210 






215 


220 










Asp 


Leu 


Gin Gly 


Asn Asn 


Glu Ala Tyr Ala 


Glu Leu 


lie 


Val 


Leu 


Asp 


225 






230 




235 








240 


Trp 


Leu 


Phe Val 


Val Val 


Val Cys Leu Ala 


Ala Phe 


Leu 


He 


Phe 


Leu 






245 


250 








255 




Leu 


Leu 


Gly He 


Cys Trp 


Cys Gin Cys Cys 


Pro His 


Thr 


Cys 


Cys 


Cys 






260 




265 






270 






Tyr 


Val 


Arg Cys 


Pro Cys 


Cys Pro Asp Lys 


Cys Cys 


Cys 


Pro 


Glu 


Ala 






275 




280 




285 








Leu 


Tyr 


Ala Ala 


Gly Lys 


Ala Ala Thr Ser 


Gly Val 


Pro 


Ser 


He 


Tyr 




290 






295 


300 










Ala 


Pro 


Ser Thr 


Tyr Ala 


His Leu Ser Pro 


Ala Lys 


Thr 


Pro 


Pro 


Pro 


305 






310 




315 








320 


Pro 


Ala 


Met He 


Pro Met 


Gly Pro Ala Tyr 


Asn Gly 


Tyr 


Pro 


Gly 


Gly 








325 


330 








335 




Tyr 


Pro 


Gly Asp 


Val Asp 


Arg Ser Ser Ser 


Ala Gly 


Gly 


Gin 


Gly 


Ser 






340 




345 






350 






Tyr 


Val 


Pro Leu 


Leu Arg 


Asp Thr Asp Ser 


Ser Val 


Ala 


Ser 


Glu 


Val 






355 




360 




365 








Arg 


Ser 


Gly Tyr 


Arg He 


Gin Ala Ser Gin 


Gin Asp 


Asp 


Ser 


Met 


Arg 




370 






375 


380 










Val 


Leu 


Tyr Tyr 


Met Glu 


Lys Glu Leu Ala 


Asn Phe 


Asp 


Pro 


Ser 


Arg 


385 






390 




395 








400 


Pro 


Gly 


Pro Pro 


Ser Gly 


Arg Val Glu Arg 


Ala Met 


Ser 


Glu 


Val 


Thr 








405 


410 








415 




Ser 


Leu 


His Glu 


Asp Asp 


Trp Arg Ser Arg 


Pro Ser 


Arg 


Gly 


Pro 


Ala 






420 




425 






430 






Leu 


Thr 


Pro He 


Arg Asp 


Glu Glu Trp Gly 


Gly His 


Ser 


Pro 


Arg 


Ser 






435 




440 




445 








Pro 


Arg 


Gly Trp 


Asp Gin 


Glu Pro Ala Arg 


Glu Gin 


Ala 


Gly 


Gly 


Gly 




450 






455 


460 










Trp 


Arg 


Ala Arg 


Arg Pro 


Arg Ala Arg Ser 


Val Asp 


Ala 


Leu 


Asp 


Asp 


465 






470 




475 








480 


Leu 


Thr 


Pro Pro 


Ser Thr 


Ala Glu Ser Gly 


Ser Arg 


Ser 


Pro 


Thr 


Ser 








485 


490 








495 




Asn 


Gly 


Gly Arg 


Ser Arg 


Ala Tyr Met Pro 


Pro Arg 


Ser 


Arg 


Ser 


Arg 






500 




505 






510 






Asp 


Asp 


Leu Tyr 


Asp Gin 


Asp Asp Ser Arg 


Asp Phe 


Pro 


Arg 


Ser 


Arg 






515 




520 




525 








Asp 


Pro 


His Tyr 


Asp Asp 


Phe Arg Ser Arg 


Glu Arg 


Pro 


Pro 


Ala 


Asp 




530 






535 


540 










Pro 


Arg 


Ser His 


His His 


Arg Thr Arg Asp 


Pro Arg 


Asp 


Asn 


Gly 


Ser 


545 






550 




555 








560 


Arg 


Ser 


Gly Asp 


Leu Pro 


Tyr Asp Gly Arg 


Leu Leu 


Glu 


Glu 


Ala 


Val 








565 


570 








575 




Arg 


Lys 


Lys Gly 


Ser Glu 


Glu Arg Arg Arg 


Pro His 


Lys 


Glu 


Glu 


Glu 






580 




585 






590 






Glu 


Glu 


Ala Tyr 


Tyr Pro 


Pro Ala Pro Pro 


Pro Tyr 


Ser 


Glu 


Thr 


Asp 






595 




600 




605 








Ser 


Gin 


Ala Ser 


Arg Glu 


Arg Arg Leu Lys 


Lys Asn 


Leu 


Ala 


Leu 


Ser 



610 615 620 



Arg Glu Ser Leu Val Val 
625 630 

<210> 6 

<211> 1954 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> allele 
<222> 595 

<223> 9-3-324 : polymorphic base C or T 
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<220> 

<221> allele 
<222> 987 

<223> 9-7-325 polymorphic base A or G 
<220> 

<221> allele 
<222> 1158 

<223> 9-9-246 : polymorphic base G or C 
<220> 

<22l> allele 
<222> 1454 

<223> LSRX9fl3-BM : polymorphic base deletion of AGG 
<220> 

<221> allele 
<222> 1875 

<223> LSRX9fl4-BM : polymorphic base T or G 
<400> 6 

tggagtgtgg ctcggaggac cgcggcgggt caagcacctt tctcccccat atctgaaagc 60 
atgccctttg tccacgtcgt ttacgctcat taaaacttcc aga atg caa cag gac 115 

Met Gin Gin Asp 

1 



gga 


ctt 


gga 


gta 


ggg 


aca 


agg 


aac 


gga 


agt 


ggg 


aag 


ggg 


agg 


age 


gtg 


163 


Gly 


Leu 


Gly 


Val 


Gly 


Thr 


Arg 


Asn 


Gly 


Ser 


Gly 


Lys 


Gly 


Arg 


Ser 


Val 




5 










10 










15 










20 




cac 


ccc 


tec 


tgg 


cct 


tgg 


tgc 


gcg 


ccg 


cgc 


ccc 


eta 


agg 


tac 


ttt 


gga 


211 


His 


Pro 


Ser 


Trp 


Pro 


Trp 


Cys 


Ala 


Pro 


Arg 


Pro 


Leu 


Arg 


Tyr 


Phe 


Gly 












25 










30 










35 






agg 


gac 


gcg 


egg 


gcc 


aga 


cgc 


gcc 


cag 


acg 


gcc 


gcg 


atg 


gcg 


ctg 


ttg 


259 


Arg 


Asp 


Ala 


Arg 


Ala 


Arg 


Arg 


Ala 


Gin 


Thr 


Ala 


Ala 


Met 


Ala 


Leu 


Leu 










40 










45 










50 








gcc 


ggc 


ggg 


etc 


tec 


aga 


ggg 


ctg 


ggc 


tec 


cac 


ccg 


gcc 


gcc 


gca 


ggc 


307 


Ala 


Gly 


Gly 


Leu 


Ser 


Arg 


Gly 


Leu 


Gly 


Ser 


His 


Pro 


Ala 


Ala 


Ala 


Gly 








55 










60 










65 










egg 


gac 


gcg 


gtc 


gtc 


ttc 


gtg 


tgg 


ctt 


ctg 


ctt 


age 


acc 


tgg 


tgc 


aca 


355 


Arg 


Asp 


Ala 


Val 


Val 


Phe 


Val 


Trp 


Leu 


Leu 


Leu 


Ser 


Thr 


Trp 


Cys 


Thr 






70 










75 










80 












get 


cct 


gcc 


agg 


gcc 


ate 


cag 


gtg 


acc 


gtg 


tec 


aac 


ccc 


tac 


cac 


gtg 


403 


Ala 


Pro 


Ala 


Arg 


Ala 


He 


Gin 


Val 


Thr 


val 


Ser 


Asn 


Pro 


Tyr 


His 


Val 




85 










90 










95 










100 




gtg 


ate 


etc 


ttc 


cag 


cct 


gtg 


acc 


ctg 


ccc 


tgt 


acc 


tac 


cag 


atg 


acc 


451 


val 


He 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


Pro 


Cys 


Thr 


Tyr 


Gin 


Met 


Thr 












105 










110 










115 






teg 


acc 


ccc 


acg 


caa 


ccc 


ate 


gtc 


ate 


tgg 


aag 


tac 


aag 


tct 


ttc 


tgc 


499 


Ser 


Thr 


Pro 


Thr 


Gin 


Pro 


He 


Val 


He 


Trp 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 










120 










125 










130 








egg 


gac 


cgc 


ate 


gcc 


gat 


gcc 


ttc 


tec 


ccg 


gcc 


age 


gtc 


gac 


aac 


cag 


547 


Arg 


Asp 


Arg 


He 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 








135 










140 










145 










etc 


aat 


gcc 


cag 


ctg 


gca 


gcc 


ggg 


aac 


cca 


ggc 


tac 


aac 


ccc 


tac 


gty 


595 


Leu 


Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 


Pro 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 






150 










155 










160 












gag 


tgc 


cag 


gac 


age 


gtg 


cgc 


acc 


gtc 


agg 


gtc 


gtg 


gcc 


acc 


aag 


cag 


643 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 


val 


val 


Ala 


Thr 


Lys 


Gin 




165 










170 










175 










180 




ggc 


aac 


get 


gtg 


acc 


ctg 


gga 


gat 


tac 


tac 


cag 


ggc 


egg 


agg 


att 


acc 


691 


Gly 


Asn 


Ala 


Val 


Thr 


Leu 


Gly 


Asp 


Tyr 


Tyr 


Gin 


Gly 


Arg 


Arg 


lie 


Thr 












185 










190 










195 






ate 


acc 


gga 


aat 


get 


gac 


ctg 


acc 


ttt 


gac 


cag 


acg 


gcg 


tgg 


ggg 


gac 


739 
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lie 


Thr 


Gly 


Asn 


Ala 


Asp 


Leu 


Thr 


Phe 


Asp 


Gin 


Thr 


Ala 


Trp 


Gly 


Asp 










200 










205 










210 








agt 


ggt 


gtg 


tat 


tac 


tgc 


tec 


gtg 


gtc 


tea 


gcc 


cag 


gac 


etc 


cag 


ggg 


787 


Ser 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 


Ser 


Ala 


bin 


ASp 


Leu 


Gin 


Gly 








215 










220 










*5 ^ C 










aac 


aat 


gag 


gcc 


tac 


gca 


gag 


etc 


ate 


gtc 


ctt 


gtg 


t- -i t_ 

cat 


gcc 


gcc 


ggc 


O *5 C 

835 


Asn 


Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 


Leu 


lie 


Val 


Leu 


Val 


Tyr 


Ala 


Ala 


Gly 






230 










235 










240 












aaa 


gca 


gcc 


acc 


tea 


ggt 


gtt 


ccc 


age 


att 


tat 


gcc 


ccc 


age 


acc 


tat 


883 


Lys 


Ala 


Ala 


Thr 


Ser 


Gly 


Val 


Pro 


Ser 


lie 


Tyr 


Ala 


Pro 


Ser 


Thr 


Tyr 




245 










250 










255 










260 




gcc 


cac 


ctg 


tct 


ccc 


gcc 


aag 


acc 


cca 


ccc 


cca 


cca 


get 


atg 


att 


ccc 


931 


Ala 


His 


Leu 


Ser 


Pro 


Ala 


Lys 


Thr 


Pro 


Pro 


Pro 


Pro 


Ala 


Met 


lie 


Pro 












265 










270 










275 






atg 


ggc 


cct 


gcc 


tac 


aac 


ggg 


tac 


cct 


gga 


gga 


tac 


cct 


gga 


gac 


gtt 


979 


Met 


Gly 


Pro 


Ala 


Tyr 


Asn 


Gly 


Tyr 


Pro 


Gly 


Gly 


Tyr 


Pro 


Gly 


Asp 


Val 










280 










285 










290 








gac 


agg 


art 


age 


tea 


get 


ggt 


ggc 


caa 


ggc 


tec 


tat 


gta 


CCC 


ctg 


ctt 


1027 


Asp 


Arg 


Xaa 


Ser 


Ser 


Ala 


Gly 


Gly 


Gin 


Gly 


Ser 


Tyr 


Val 


Pro 


Leu 


Leu 








295 










300 










305 










egg 


gac 


acg 


gac 


age 


agt 


gtg 


gcc 


tct 


gaa 


gtc 


cgc 


agt 


ggc 


tac 


agg 


1075 


Arg 


Asp 


Thr 


Asp 


Ser 


Ser 


Val 


Ala 


Ser 


Glu 


Val 


Arg 


Ser 


Gly 


Tyr 


Arg 






310 










315 










320 












att 


cag 


gcc 


age 


cag 


cag 


gac 


gac 


tec 


atg 


egg 


gtc 


ctg 


tac 


tac 


atg 


1123 


lie 


Gin 


Ala 


Ser 


Gin 


Gin 


Asp 


Asp 


Ser 


Met 


Arg 


Val 


Leu 


Tyr 


Tyr 


Met 




325 










330 










335 










340 




gag 


aag 


gag 


ctg 


gcc 


aac 


ttc 


gac 


cct 


tct 


cga 


est 


ggc 


ccc 


ccc 


agt 


1171 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


Phe 


Asp 


Pro 


Ser 


Arg 


Xaa 


Gly 


Pro 


Pro 


Ser 












345 










350 










355 






ggc 


cgt 


gtg 


gag 


egg 


gcc 


atg 


agt 


gaa 


gtc 


acc 


tec 


etc 


cac 


gag 


gac 


1219 


Gly 


Arg 


Val 


Glu 


Arg 


Ala 


Met 


Ser 


Glu 


Val 


Thr 


Ser 


Leu 


His 


Glu 


Asp 










360 










365 










370 








gac 


tgg 


cga 


tct 


egg 


cct 


tec 


egg 


ggc 


cct 


gcc 


etc 


acc 


ccg 


ate 


egg 


1267 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


Gly 


Pro 


Ala 


Leu 


Thr 


Pro 


He 


Arg 








375 










380 










385 










gat 


gag 


gag 


tgg 


ggt 


ggc 


cac 


tec 


ccc 


egg 


agt 


ccc 


agg 


gga 


tgg 


gac 


1315 


Asp 


Glu 


Glu 


Trp 


Gly 


Gly 


His 


Ser 


Pro 


Arg 


Ser 


Pro 


Arg 


Gly 


Trp 


Asp 






390 










395 










400 












cag 


gag 


ccc 


gcc 


agg 


gag 


cag 


gca 


ggc 


ggg 


ggc 


tgg 


egg 


gcc 


agg 


egg 


1363 


Gin 


Glu 


Pro 


Ala 


Arg 


Glu 


Gin 


Ala 


Gly 


Gly 


Gly 


Trp 


Arg 


Ala 


Arg 


Arg 




405 










410 










415 










420 




ccc 


egg 


gcc 


cgc 


tec 


gtg 


gac 


gcc 


ctg 


gac 


gac 


etc 


acc 


ccg 


ccg 


age 


1411 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 


Ala 


Leu 


Asp 


Asp 


Leu 


Thr 


Pro 


Pro 


Ser 












425 










430 










435 






acc 


gcc 


gag 


tea 


ggg 


age 


agg 


tct 


ccc 


acg 


agt 


aat 


ggt 


ggg 


aga 


age 


1459 


Thr 


Ala 


Glu 


Ser 


Gly 


Ser 


Arg 


Ser 


Pro 


Thr 


Ser 


Asn 


Gly 


Gly 


Arg 


Ser 










440 










445 










450 








egg 


gcc 


tac 


atg 


ccc 


ccg 


egg 


age 


cgc 


age 


egg 


gac 


gac 


etc 


tat 


gac 


1507 


Arg 


Ala 


Tyr 


Met 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 


Arg 


Asp 


Asp 


Leu 


Tyr 


Asp 








455 










460 










465 










caa 


gac 


gac 


teg 


agg 


gac 


ttc 


cea 


cgc 


tec 


egg 


gac 


ccc 


cac 


tac 


gac 


1555 


Gin 


Asp 


Asp 


Ser 


Arg 


Asp 


Phe 


Pro 


Arg 


Ser 


Arg 


Asp 


Pro 


His 


Tyr 


Asp 






470 










475 










480 












gac 


ttc 


agg 


tct 


egg 


gag 


cgc 


cct 


cct 


gcc 


gac 


ccc 


agg 


tec 


cac 


cac 


1603 


Asp 


Phe 


Arg 


Ser 


Arg 


Glu 


Arg 


Pro 


Pro 


Ala 


Asp 


Pro 


Arg 


Ser 


His 


His 




485 










490 










495 










500 




cac 


cgt 


acc 


egg 


gac 


cct 


egg 


gac 


aac 


ggc 


tec 


agg 


tec 


ggg 


gac 


etc 


1651 


His 


Arg 


Thr 


Arg 


Asp 


Pro 


Arg 


Asp 


Asn 


Gly 


Ser 


Arg 


Ser 


Gly 


Asp 


Leu 












505 










510 










515 






ccc 


tat 


gat 


ggg 


egg 


eta 


ctg 


gag 


gag 


get 


gtg 


agg 


aag 


aag 


ggg 


teg 


1699 


Pro 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 


Glu 


Glu 


Ala 


Val 


Arg 


Lys 


Lys 


Gly 


Ser 










520 










525 










530 








gag 


gag 


agg 


agg 


aga 


ccc 


cac 


aag 


gag 


gag 


gag 


gaa 


gag 


gcc 


tac 


tac 


1747 
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Glu Glu Arg Arg Arg Pro His Lys Glu Glu Glu Glu Glu Ala Tyr Tyr 

535 540 545 

ccg ccc gcg ccg ccc ccg tac teg gag acc gac teg cag gcg tec cga 1795 
Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg 

550 555 560 

gag cgc agg etc aag aag aac ttg gee ctg agt egg gaa agt tta gtc 1843 
Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu Val 
565 570 575 580 

gtc tga tctgacgttt tetaegtage ttttgkattt ttttttttaa tttgaaggaa 1899 
Val * 

cactgatgaa gccctgccat acccctcccg agtctaataa aacgtataat cacaa 1954 

<210> 7 
<211> 581 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> VARIANT 

<222> 295 

<223> 9-7-325 : polymorphic amino acid Ser or Asn 
<220> 

<221> VARIANT 

<222> 352 

<223> 9-9-246 : polymorphic amino acid Pro or Arg 
<220> 
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<400> 7 



Met 
1 


Gin 


Gin 


Asp 


Gly 


Arg 


Ser 


Val 








20 


Arg 


Tyr 


Phe 


Gly 






35 




Met 


Ala 


Leu 


Leu 




50 






Ala 


Ala 


Ala 


Gly 


65 








Thr 


Trp 


Cys 


Thr 


Pro 


Tyr 


His 


Val 








100 


Tyr 


Gin 


Met 


Thr 






115 




Lys 


Ser 


Phe 


Cys 




130 






Val 


Asp 


Asn 


Gin 


145 








Asn 


Pro 


Tyr 


Val 


Ala 


Thr 


• 

Lys 


Gin 








180 


Arg 


Arg 


He 


Thr 






195 




Ala 


Trp 


Gly 


Asp 




210 






Asp 


Leu 


Gin 


Gly 



225 



Gly 


Leu 


Gly 


Val 


5 








His 


Pro 


Ser 


Trp 


Arg 


Asp 


Ala 


Arg 








40 


Ala 


Gly 


Gly 


Leu 






55 




Arg 


Asp 


Ala 


val 




70 






Ala 


Pro 


Ala 


Arg 


85 








Val 


He 


Leu 


Phe 


Ser 


Thr 


Pro 


Thr 








120 


Arg 


Asp 


Arg 


He 






135 




Leu 


Asn 


Ala 


Gin 




150 






Glu 


Cys 


Gin 


Asp 


165 








Gly 


Asn 


Ala 


Val 


He 


Thr 


Gly 


Asn 








200 


Ser 


Gly 


Val 


Tyr 






215 




Asn 


Asn 


Glu 


Ala 




230 







Gly 


Thr 


Arg 


Asn 




10 






Pro 


Trp 


Cys 


Ala 


25 








Ala 


Arg 


Arg 


Ala 


Ser 


Arg 


Gly 


Leu 








60 


Val 


Phe 


Val 


Trp 






75 




Ala 


lie 


Gin 


Val 




90 






Gin 


Pro 


Val 


Thr 


105 








Gin 


Pro 


He 


Val 


Ala 


Asp 


Ala 


Phe 








140 


Leu 


Ala 


Ala 


Gly 






155 




Ser 


Val 


Arg 


Thr 




170 






Thr 


Leu 


Gly 


Asp 


185 








Ala 


Asp 


Leu 


Thr 


Tyr 


Cys 


Ser 


val 








220 


Tyr 


Ala 


Glu 


Leu 






235 





Gly 


Ser 


Gly 


Lys 






15 




Pro 


Arg 


Pro 


Leu 




30 






Gin 


Thr 


Ala 


Ala 


45 








Gly 


Ser 


His 


Pro 


Leu 


Leu 


Leu 


Ser 








80 


Thr 


Val 


Ser 


Asn 






95 




Leu 


Pro 


Cys 


Thr 




110 






He 


Trp 


Lys 


Tyr 


125 








Ser 


Pro 


Ala 


Ser 


Asn 


Pro 


Gly 


Tyr 








160 


Val 


Arg 


Val 


Val 






175 




Tyr 


Tyr 


Gin 


Gly 




190 






Phe 


Asp 


Gin 


Thr 


205 








Val 


Ser 


Ala 


Gin 


He 


Val 


Leu 


val 



240 
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Tyr Ala 


Ala 


Gly 


Lys 


TV 1 - 

Aia 








245 




Pro Ser 


Thr 


Tyr 


Ala 


His 






260 






Ala Met 


lie 


Pro 


Met 


Gly 




275 








Pro Gly 


Asp 


Val 


Asp 


Arg 


290 










Val Pro 


Leu 


Leu 


Arg 


Asp 


305 








310 


Ser Gly 


Tyr 


Arg 


lie 


Gin 








325 




Leu Tyr 


Tyr 


Met 


Glu 


Lys 






340 






Gly Pro 


Pro 


Ser 


Gly 


Arg 




355 








Leu His 


Glu 


Asp 


Asp 


Trp 


370 










Thr Pro 


lie 


Arg 


Asp 


Glu 


385 








390 


Arg Gly 


Trp 


Asp 


Gin 


Glu 








405 




Arg Ala 


Arg 


Arg 


Pro 


Arg 






420 






Thr Pro 


Pro 


Ser 


Thr 


Ala 




435 








Gly Gly 


Arg 


Ser 


Arg 


Ala 


450 










Asp Leu 


Tyr 


Asp 


Gin 


Asp 


465 








470 


Pro His 


Tyr 


Asp 


Asp 


Phe 








485 




Arg Ser 


His 


His 


His 


Arg 






500 






Ser Gly 


Asp 


Leu 


Pro 


Tyr 




515 








Lys Lys 


Gly 


Ser 


Glu 


Glu 


530 










Glu Ala 


Tyr 


Tyr 


Pro 


Pro 


545 








550 


Gin Ala 


Ser 


Arg 


Glu 


Arg 








565 




Glu Ser 


Leu 


Val 


Val 








580 
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Ala 


Thr 


Ser 


Gly 


Val 


Pro 








250 






Leu 


Ser 


Pro 


Ala 


Lys 


Thr 






265 








Pro 


Ala 


Tyr 


Asn 


Gly 


Tyr 




280 










Ser 


Ser 


Ser 


Ala 


Gly 


Gly 


295 










300 


Thr 


Asp 


Ser 


Ser 


Val 


Ala 










315 




Ala 


Ser 


Gin 


Gin 


Asp 


Asp 








330 






Glu 


Leu 


Ala 


Asn 


Phe 


Asp 






345 








Val 


Glu 


Arg 


Ala 


Met 


Ser 




360 










Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


375 










380 


Glu 


Trp 


Gly 


Gly 


His 


Ser 










395 




Pro 


Ala 


Arg 


Glu 


Gin 


Ala 








410 






Ala 


Arg 


Ser 


Val 


Asp 


Ala 






425 








Glu 


Ser 


Gly 


Ser 


Arg 


Ser 




440 










Tyr 


Met 


Pro 


Pro 


Arg 


Ser 


455 










460 


Asp 


Ser 


Arg 


Asp 


Phe 


Pro 










475 




Arg 


Ser 


Arg 


Glu 


Arg 


Pro 








490 






Thr 


Arg 


Asp 


Pro 


Arg 


Asp 






505 








Asp 


Gly 


Arg 


Leu 


Leu 


Glu 




520 










Arg 


Arg 


Arg 


Pro 


His 


Lys 


535 










540 


Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 










555 




Arg 


Leu 


Lys 


Lys 


Asn 


Leu 



570 



PCT/IBOO/01470 



Ser 


He 


Tyr 


Ala 






255 




Pro 


Pro 


Pro 


Pro 




270 






Pro 


Gly 


Gly 


Tyr 


285 








Gin 


Gly 


Ser 


Tyr 


Ser 


Glu 


Val 


Arg 








320 


Ser 


Met 


Arg 


Val 






335 




Pro 


Ser 


Arg 


Pro 




350 






Glu 


Val 


Thr 


Ser 


365 








Gly 


Pro 


Ala 


Leu 


Pro 


Arg 


Ser 


Pro 








400 


Gly 


Gly 


Gly 


Trp 






415 




Leu 


Asp 


Asp 


Leu 




430 






Pro 


Thr 


Ser 


Asn 


445 








Arg 


Ser 


Arg 


Asp 


Arg 


Ser 


Arg 


Asp 








480 


Pro 


Ala 


Asp 


Pro 






495 




Asn 


Gly 


Ser 


Arg 




510 






Glu 


Ala 


Val 


Arg 


525 








Glu 


Glu 


Glu 


Glu 


Glu 


Thr 


Asp 


Ser 








560 


Ala 


Leu 


Ser 


Arg 






575 
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accgctcacc aggtcagttg tccccggaaa gccgaaggca tgagcttcgc ccaagttctt 60 
tttatgggtt agaactcctc cagagcgggg gaaaaaggac ttggaatagg ggcgggacgg 12 0 
agcacgcacc cttctccgcc ttggttctcg ccgcgccccc tactctcggg atacttggga 180 
ggggacgcgc gggcaccgtc gctgctagac ggccgcg atg gcg ccg gcg gcc ggc 235 

Met Ala Pro Ala Ala Gly 
1 5 

gcg tgt get ggg gcg cct gac tec cac cca get acc gtg gtc ttc gtg 283 
Ala Cys Ala Gly Ala Pro Asp Ser His Pro Ala Thr Val Val Phe Val 

10 15 20 

tgt etc ttt etc ate att ttc tgc cca gac cct gcc agt gcc ate cag 331 
Cys Leu Phe Leu He He Phe Cys Pro Asp Pro Ala Ser Ala He Gin 
25 30 35 
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gtg act 


gtg 


tct 


gac 


ccc 


tac 


cac 


gta 


gtg 


ate 


ctg 


ttc 


cag 


cca 


gtg 


379 


Val 


Thr 
40 


Val 


Ser 


Asp 


Pro 


Tyr 
45 


His 


Val 


Val 


He 


Leu 
50 


Phe 


Gin 


Pro 


Val 




acc 


ctg 


ccc 


tgc 


acc 


tat 


cag 


atg 


age 


aac 


act 


etc 


aca 


gtc 


ccc 


ate 


427 


Thr 


Leu 


Pro 


Cys 


Thr 


Tyr 


Gin 


Met 


Ser 


Asn 


Thr 


Leu 


Thr 


Val 


Pro 


He 




55 










60 










65 










70 




gtg 


ate 


tgg 


aag 


tac 


aag 


tea 


ttc 


tgc 


egg 


gac 


cgt 


att 


gee 


gat 


gec 


475 


Val 


lie 


Trp 


Lys 


Tyr 
75 


Lys 


Ser 


Phe 


Cys 


Arg 
80 


Asp 


Arg 


He 


Ala 


Asp 
85 


Ala 




ttc 


tct 


cct 


gee 


agt 


gtg 


gac 


aac 


cag 


eta 


aat 


gec 


cag 


ttg 


gca 


get 


523 


Phe 


Ser 


Pro 


Ala 
90 


Ser 


Val 


Asp 


Asn 


Gin 
95 


Leu 


Asn 


Ala 


Gin 


Leu 
100 


Ala 


Ala 




ggc 


aac 


ccc 


ggc 


tac 


aac 


ccc 


tat 


gtg 


gag 


tgc 


cag 


gac 


agt 


gta 


cgc 


571 


Gly Asn 


Pro 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 








105 










110 










115 










act gtc 


agg 


gtg 


gtg 


gee 


acc 


aaa 


cag 


ggc 


aat 


gcg 


gtg 


acc 


ctg 


gga 


619 


Thr 


Val 
120 


Arg 


Val 


Val 


Ala 


Thr 
125 


Lys 


Gin 


Gly 


Asn 


Ala 
130 


Val 


Thr 


Leu 


Gly 




gac 


tac 


tac 


caa 


ggc 


agg 


agg 


ate 


acc 


ata 


aca 


gga 


aat 


get 


gac 


ctg 


667 


Asp Tyr 


Tyr 


Gin 


Gly 


Arg 


Arg 


He 


Thr 


He 


Thr 


Gly 


Asn 


Ala 


Asp 


Leu 




135 










140 










145 










150 




acc 


ttc 


gag 


cag 


aca 


gee 


tgg 


gga 


gac 


agt 


gga 


gtg 


tat 


tac 


tgc 


tct 


715 


Thr 


Phe 


Glu 


Gin 


Thr 
155 


Ala 


Trp 


Gly 


Asp 


Ser 
160 


Gly 


Val 


Tyr 


Tyr 


Cys 
165 


Ser 




gtg 


gtc 


teg 


gee 


caa 


gat 


ctg 


gat 


gga 


aac 


aac 


gag 


gcg 


tac 


gca 


gag 


763 


Val 


Val 


Ser 


Ala 
170 


Gin 


Asp 


Leu 


Asp 


Gly 
175 


Asn 


Asn 


Glu 


Ala 


Tyr 
180 


Ala 


Glu 




etc 


ate 


gtc 


ctt 


ggc 


agg 


acc 


tea 


gag 


gee 


cct 


gag 


etc 


eta 


cct 


ggt 


811 


Leu 


He 


Val 
185 


Leu 


Gly 


Arg 


Thr 


Ser 
190 


Glu 


Ala 


Pro 


Glu 


Leu 
195 


Leu 


Pro 


Gly 




ttt 


egg 


gcg 


ggg 


ccc 


ttg 


gaa 


gat 


tgg 


etc 


ttt 


gtg 


gtc 


gtg 


gtc 


tgc 


859 


Phe Arg 


Ala 


Gly 


Pro 


Leu 


Glu 


Asp 


Trp 


Leu 


Phe 


Val 


Val 


Val 


Val 


Cys 






200 










205 
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ctg 


gcg 


age 


etc 


etc 


etc 


ttc 


etc 


etc 


ctg 


ggc 


ate 


tgc 


tgg 


tgc 


cag 


907 


Leu 


Ala 


Ser 


Leu 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Cys 


Trp 


Cys 


Gin 




215 










220 










225 










230 




tgc 


tgt 


cct 


cac 


acc 


tgc 


tgc 


tgc 


tat 


gtc 


cga 


tgt 


ccc 


tgc 


tgc 


cca 


955 


Cys 


Cys 


Pro 


His 


Thr 
235 


Cys 


Cys 


Cys 


Tyr 


val 

240 


Arg 


Cys 


Pro 


Cys 


Cys 
245 


Pro 




gac 


aag 


tgc 


tgt 


tgc 


cct 


gag 


get 


ctt 


tat 


get 


get 


ggc 


aaa 


gca 


gec 


1003 


Asp Lys 


Cys 


Cys 


Cys 


Pro 


Glu 


Ala 


Leu 


Tyr 


Ala 


Ala 


Gly 


Lys 


Ala 


Ala 










250 










255 










260 








acc 


tea 


ggt 


gtc 


ccg 


age 


ate 


tat 


gee 


ccc 


age 


ate 


tat 


acc 


cac 


etc 


1051 


Thr 


Ser 


Gly 
265 


Val 


Pro 


Ser 


He 


Tyr 
270 


Ala 


Pro 


Ser 


He 


Tyr 
275 


Thr 


His 


Leu 




tea 


cct 


gec 


aag 


acc 


cca 


cca 


cct 


ccg 


cct 


gee 


atg 


att 


ccc 


atg 


ggc 


1099 


Ser 


Pro 
280 


Ala 


Lys 


Thr 


Pro 


Pro 
285 


Pro 


Pro 


Pro 


Ala 


Met 
290 


He 


Pro 


Met 


Gly 




cct 


ccc 


tat 


ggg 


tac 


cct 


gga 


gac 


ttt 


gac 


aga 


cat 


age 


tea 


gtt 


ggt 


1147 


Pro 


Pro 


Tyr 


Gly 


Tyr 


Pro 


Gly 


Asp 


Phe 


Asp 


Arg 


His 


Ser 


Ser 


Val 


Gly 




295 










300 










305 










310 




ggc 


cac 


age 


tec 


caa 


gta 


ccc 


ctg 


ctg 


cgt 


gac 


gtg 


gat 


ggc 


agt 


gta 


1195 


Gly His 


Ser 


Ser 


Gin 


Val 


Pro 


Leu 


Leu 


Arg 


Asp 


Val 


Asp 


Gly 


Ser 


Val 












315 










320 










325 






tct 


tea 


gaa 


gta 


cga 


agt 


ggc 


tac 


agg 


ate 


cag 


get 


aac 


cag 


caa 


gat 


1243 


Ser 


Ser 


Glu 


Val 
330 


Arg 


Ser 


Gly 


Tyr 


Arg 
335 


He 


Gin 


Ala 


Asn 


Gin 
340 


Gin 


Asp 




gac 


tec 


atg 


agg 


gtc 


eta 


tac 


tat 


atg 


gag 


aaa 


gag 


eta 


gec 


aac 


ttt 


1291 


Asp Ser 


Met 


Arg 


Val 


Leu 


Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


Phe 








345 










350 










355 










gac 


cct 


tec 


cga 


cct 


ggc 


cct 


ccc 


aat 


ggc 


aga 


gtg 


gaa 


egg 


gec 


atg 


1339 


Asp 


Pro 
360 


Ser 


Arg 


Pro 


Gly 


Pro 
365 


Pro 


Asn 


Gly 


Arg 


Val 
370 


Glu 


Arg 


Ala 


Met 
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agt 


gaa 


gta 


ace 


tec 


etc 


cat 


gaa 


gat 


gac 


tgg 


cga 


teg 


agg 


cct 


tec 


1387 


Ser 


Glu 


Val 


Thr 


Ser 


Leu 


His 


Glu 


Asp 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 




375 










380 










385 










390 




agg 


get 


cct 


gee 


etc 


acc 


ccc 


ate 


agg 


gat 


gag 


gag 


tgg 


aat 


cgc 


cac 


1435 


Arg Ala 


Pro 


Ala 


Leu 


Thr 


Pro 


He 


Arg 


Asp 


Glu 


Glu 


Trp 


Asn 


Arg 


His 












395 










400 










405 






tec 


cca 


cag 


agt 


CCC 


aga 


aca 


tgg 


gag 


cag 


gaa 


ccc 


ctt 


caa 


gaa 


caa 


1483 


Ser 


Pro 


Gin 


Ser 
410 


Pro 


Arg 


Thr 


Trp 


Glu 
415 


Gin 


Glu 


Pro 


Leu 


Gin 
420 


Glu 


Gin 




cca 


agg 


ggt 


ggt 


tgg 


ggg 


tct 


gga 


cgc 


cct 


egg 


gec 


cgc 


tct 


gtg 


gat 


1531 


Pro Arg 


Gly 


Gly 


Trp 


Gly 


Ser 


Gly 


Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 








425 










430 










435 










get 


eta 


gat 


gat 


ate 


aac 


egg 


cct 


ggc 


tec 


act 


gaa 


tea 


gga 


egg 


tct 


1579 


Ala 


Leu 
440 


Asp 


Asp 


He 


Asn 


Arg 
445 


Pro 


Gly 


Ser 


Thr 


Glu 
450 


Ser 


Gly 


Arg 


Ser 




tct 


ccc 


cca 


agt 


agt 


gga 


egg 


aga 


gga 


egg 


gec 


tat 


gca 


cct 


cca 


aga 


1627 


Ser 


Pro 


Pro 


Ser 


Ser 


Gly 


Arg 


Arg 


Gly 


Arg 


Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 




455 










460 










465 










470 




agt 


cgc 


age 


egg 


gat 


gac 


etc 


tat 


gac 


ccg 


gac 


gat 


cct 


agg 


gac 


ttg 


1675 


Ser 


Arg 


Ser 


Arg 


Asp 

475 


Asp 


Leu 


Tyr 


Asp 


Pro 
480 


Asp 


Asp 


Pro 


Arg 


Asp 
485 


Leu 




cca 


cat 


tec 


cga 


gat 


ccc 


cac 


tat 


tat 


gac 


gac 


ate 


agg 


tct 


aga 


gat 


1723 


Pro 


His 


Ser 


Arg 
490 


Asp 


Pro 


His 


Tyr 


Tyr 
495 


Asp 


Asp 


He 


Arg 


Ser 
500 


Arg 


Asp 




cca 


cgt 


get 


gac 


ccc 


aga 


tec 


cgt 


cag 


cga 


tec 


cga 


gat 


cct 


egg 


gat 


1771 


Pro 


Arg 


Ala 
505 


Asp 


Pro 


Arg 


Ser 


Arg 
510 


Gin 


Arg 


Ser 


Arg 


Asp 
515 


Pro 


Arg 


Asp 




get 


ggc 


ttc 


agg 


tea 


agg 


gac 


cct 


cag 


tat 


gat 


ggg 


cga 


eta 


tta 


gaa 


1819 


Ala Gly 


Phe 


Arg 


Ser 


Arg 


Asp 


Pro 


Gin 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 


Glu 






520 










525 










530 












gag 


get 


tta 


aag 


aaa 


aag 


ggg 


teg 


ggc 


gag 


aga 


agg 


agg 


gtt 


tac 


agg 


1867 


Glu 


Ala 


Leu 


Lys 


Lys 


Lys 


Gly 


Ser 


Gly 


Glu 


Arg 


Arg 


Arg 


Val 


Tyr 


Arg 




535 










540 










545 










550 




gag 


gaa 


gaa 


gag 


gaa 


gag 


gag 


ggc 


caa 


tac 


ccc 


cca 


gca 


cct 


cca 


cct 


1915 


Glu 


Glu 


Glu 


Glu 


Glu 
555 


Glu 


Glu 


Gly 


Gin 


Tyr 
560 


Pro 


Pro 


Ala 


Pro 


Pro 
565 


Pro 




tac 


tea 


gag 


act 


gac 


teg 


cag 


gee 


tea 


egg 


gag 


agg 


agg 


ctg 


aaa 


aag 


1963 


Tyr 


Ser 


Glu 


Thr 
570 


Asp 


Ser 


Gin 


Ala 


Ser 
575 


Arg 


Glu 


Arg 


Arg 


Leu 
580 


Lys 


Lys 




aat 


ttg 


gee 


ctg 


agt 


egg 


gaa 


agt 


tta 


gtc 


gtc 


tga 


tccacgtttt 




2009 


Asn 


Leu 


Ala 


Leu 


Ser 


Arg 


Glu 


Ser 


Leu 


Val 


Val 


* 
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Met 


Ala 


Pro 


Ala 


Ala 


Gly 


Ala 


Cys 


Ala 


Gly 


Ala 


Pro 


Asp 


Ser 


His 


Pro 


1 








5 










10 










15 




Ala 


Thr 


Val 


Val 


Phe 


Val 


Cys 


Leu 


Phe 


Leu 


He 


He 


Phe 


Cys 


Pro 


Asp 








20 










25 










30 






Pro 


Ala 


Ser 


Ala 


He 


Gin 


Val 


Thr 


Val 


Ser 


Asp 


Pro 


Tyr 


His 


Val 


Val 






35 










40 










45 








He 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


Pro 


Cys 


Thr 


Tyr 


Gin 


Met 


Ser 


Asn 




50 










55 










60 










Thr 


Leu 


Thr 


Val 


Pro 


He 


Val 


He 


Trp 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 


Arg 


65 










70 










75 










80 


Asp Arg 


He 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 


Leu 










85 










90 










95 
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Asn 


Ala 


Gin 


Leu 


Ala 


Ala 








100 






Cys 


Gin 


Asp 


Ser 


Val 


Arg 






115 








Asn 


Ala 


Val 


Thr 


Leu 


Gly 




130 










Thr 


Gly 


Asn 


Ala 


Asp 


Leu 


145 










150 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 










165 




Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 








180 






Pro 


Glu 


Leu 


Leu 


Pro 


Gly 






195 








Phe 


Val 


Val 


Val 


Val 


Cys 




210 










Gly 


lie 


Cys 


Trp 


Cys 


Gin 


225 










230 


Arg 


Cys 


Pro 


Cys 


Cys 


Pro 










245 




Ala 


Ala 


Gly 


Lys 


Ala 


Ala 








260 






Ser 


He 


Tyr 


Thr 


His 


Leu 






275 








Ala 


Met 


He 


Pro 


Met 


Gly 




290 










Arg 


His 


Ser 


Ser 


Val 


Gly 


305 










310 


Asp 


Val 


Asp 


Gly 


Ser 


Val 










325 




Gin 


Ala 


Asn 


Gin 


Gin 


Asp 








340 






Lys 


Glu 


Leu 


Ala 


Asn 


Phe 






355 








Arg 


Val 


Glu 


Arg 


Ala 


Met 




370 










Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


385 










390 


Glu 


Glu 


Trp 


Asn 


Arg 


His 










405 




Glu 


Pro 


Leu 


Gin 


Glu 


Gin 








420 






Arg 


Ala 


Arg 


Ser 


Val 


Asp 






435 








Thr 


Glu 


Ser 


Gly 


Arg 


Ser 




450 










Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 


465 










470 


Asp 


Asp 


Pro 


Arg 


Asp 


Leu 










485 




Asp 


He 


Arg 


Ser 


Arg 


Asp 








500 






Ser 


Arg 


Asp 


Pro 


Arg 


Asp 






515 








Asp 


Gly 


Arg 


Leu 


Leu 


Glu 




530 










Arg 


Arg 


Arg 


Val 


Tyr 


Arg 


545 










550 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 










565 




Glu 


Arg 


Arg 


Leu 


Lys 


Lys 



580 



Val 



39 



Gly 


Asn 


Pro 


Gly 


Tyr 


Asn 






105 








Thr 


Val 


Arg 


Val 


Val 


Ala 




120 










Asp 


Tyr 


Tyr 


Gin 


Gly 


Arg 


135 










140 


Thr 


Phe 


Glu 


Gin 


Thr 


Ala 










155 




Val 


Val 


Ser 


Ala 


Gin 


Asp 








170 






Leu 


He 


Val 


Leu 


Gly 


Arg 






185 








Phe 


Arg 


Ala 


Gly 


Pro 


Leu 




200 










Leu 


Ala 


Ser 


Leu 


Leu 


Leu 


215 










220 


Cys 


Cys 


Pro 


His 


Thr 


Cys 










235 




Asp 


Lys 


Cys 


Cys 


Cys 


Pro 








250 






Thr 


Ser 


Gly 


Val 


Pro 


Ser 






265 








Ser 


Pro 


Ala 


Lys 


Thr 


Pro 




280 










Pro 


Pro 


Tyr 


Gly 


Tyr 


Pro 


295 










300 


Gly 


His 


Ser 


Ser 


Gin 


Val 










315 




Ser 


Ser 


Glu 


Val 


Arg 


Ser 








330 






Asp 


Ser 


Met 


Arg 


Val 


Leu 






345 








Asp 


Pro 


Ser 


Arg 


Pro 


Gly 




360 










Ser 


Glu 


Val 


Thr 


Ser 


Leu 


375 










380 


Arg 


Ala 


Pro 


Ala 


Leu 


Thr 










395 




Ser 


Pro 


Gin 


Ser 


Pro 


Arg 








410 






Pro 


Arg 


Gly 


Gly 


Trp 


Gly 






425 








Ala 


Leu 


Asp 


Asp 


He 


Asn 




440 










Ser 


Pro 


Pro 


Ser 


Ser 


Gly 


455 










460 


Ser 


Arg 


Ser 


Arg 


Asp 


Asp 










475 




Pro 


His 


Ser 


Arg 


Asp 


Pro 








490 






Pro 


Arg 


Ala 


Asp 


Pro 


Arg 






505 








Ala 


Gly 


Phe 


Arg 


Ser 


Arg 




520 










Glu 


Ala 


Leu 


Lys 


Lys 


Lys 


535 










540 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 










555 




Tyr 


Ser 


Glu 


Thr 


Asp 


Ser 








570 






Asn 


Leu 


Ala 


Leu 


Ser 


Arg 



585 
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Pro 


Tyr 


Val 


Glu 




110 






Thr 


Lys 


Gin 


Gly 


125 








Arg 


He 


Thr 


He 


Trp 


Gly 


Asp 


Ser 








160 


Leu 


Asd 


Gly 


Asn 






175 




Thr 


Ser 


Glu 


Ala 




190 






Glu 


Asp 


Trp 


Leu 


205 








Phe 


Leu 


Leu 


Leu 


Cys 


Cys 


Tyr 


Val 








240 


Glu 


Ala 


Leu 


Tyr 






255 




lie 


Tyr 


Ala 


Pro 




270 






Pro 


Pro 


Pro 


Pro 


285 








Gly 


Asp 


Phe 


Asp 


Pro 


Leu 


Leu 


Arg 








320 


Gly 


Tvr 


Arq 


He 






335 




Tyr 


Tyr 


Met 


Glu 




350 






Pro 


Pro 


Asn 


Gly 


365 








His 


Glu 


Asp 


Asp 


Pro 


He 


Arg 


Asp 








400 


Thr 


Trp 


Glu 


Gin 






415 




Ser 


Gly 


Arg 


Pro 




430 






Arg 


Pro 


Gly 


Ser 


445 








Arg 


Arg 


Gly 


Arg 


Leu 


Tyr 


Asp 


Pro 








480 


His 


Tyr 


Tyr 


Asp 






495 




Ser 


Arg 


Gin 


Arg 




510 






Asp 


Pro 


Gin 


Tyr 


525 








Gly 


Ser 


Gly 


Glu 


Glu 


Gly 


Gin 


Tyr 








560 


Gin 


Ala 


Ser 


Arg 






575 




Glu 


Ser 


Leu 


Val 




590 
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<210> 10 
<211> 2040 
<212> DNA 

<213> Rattus norvegicus 



<400> 10 

accgctcacc aggtcagttg tccccggaaa gccgaaggca tgagcttcgc ccaagttctt 60 
tttatgggtt agaactcctc cagagcgggg gaaaaaggac ttggaatagg ggcgggacgg 120 
agcacgcacc cttctccgcc ttggttctcg ccgcgccccc tactctcggg atacttggga 180 
ggggacgcgc gggcaccgtc gctgctagac ggccgcg atg gcg ccg gcg gcc ggc 235 

Met Ala Pro Ala Ala Gly 
1 5 

gcg tgt get ggg gcg cct gac tec cac cca get acc gtg gtc ttc gtg 283 
Ala Cys Ala Gly Ala Pro Asp Ser His Pro Ala Thr Val Val Phe Val 

10 15 20 

tgt etc ttt etc ate att ttc tgc cca gac cct gcc agt gcc ate cag 331 
Cys Leu Phe Leu He He Phe Cys Pro Asp Pro Ala Ser Ala He Gin 

25 30 35 

gtg act gtg tct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg 379 
Val Thr Val Ser Asp Pro Tyr His Val Val He Leu Phe Gin Pro Val 

40 45 50 

acc ctg ccc tgc acc tat cag atg age aac act etc aca gtc ccc ate 427 
Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Val Pro He 
55 60 65 70 

gtg ate tgg aag tac aag tea ttc tgc egg gac cgt att gcc gat gcc 475 
Val He Trp Lys Tyr Lys Ser Phe Cys Arg Asp Arg He Ala Asp Ala 

75 80 85 

ttc tct cct gcc agt gtg gac aac cag eta aat gcc cag ttg gca get 523 
Phe Ser Pro Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala 

90 95 100 

ggc aac ccc ggc tac aac ccc tat gtg gag tgc cag gac agt gta cgc 571 
Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg 

105 110 115 

act gtc agg gtg gtg gcc acc aaa cag ggc aat gcg gtg acc ctg gga 619 
Thr Val Arg Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly 

120 125 130 

gac tac tac caa ggc agg agg ate acc ata aca gga aat get gac ctg 667 
Asp Tyr Tyr Gin Gly Arg Arg He Thr lie Thr Gly Asn Ala Asp Leu 
135 140 145 150 

acc ttc gag cag aca gcc tgg gga gac agt gga gtg tat tac tgc tct 715 
Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser 

155 160 165 

gtg gtc teg gcc caa gat ctg gat gga aac aac gag gcg tac gca gag 763 
Val Val Ser Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu 

170 175 180 

etc ate gtc ctt gat tgg etc ttt gtg gtc gtg gtc tgc ctg gcg age 811 
Leu lie Val Leu Asp Trp Leu Phe Val Val Val Val Cys Leu Ala Ser 

185 190 195 

etc etc etc ttc etc etc ctg ggc ate tgc tgg tgc cag tgc tgt cct 859 
Leu Leu Leu Phe Leu Leu Leu Gly lie Cys Trp Cys Gin Cys Cys Pro 

200 205 210 

cac acc tgc tgc tgc tat gtc cga tgt ccc tgc tgc cca gac aag tgc 907 
His Thr Cys Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys 
215 220 225 230 

tgt tgc cct gag get ctt tat get get ggc aaa gca gcc acc tea ggt 955 
Cys Cys Pro Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly 

^235 240 245 

gtc ccg age ate tat gcc ccc age ate tat acc cac etc tea cct gcc 1003 
Val Pro Ser lie Tyr Ala Pro Ser lie Tyr Thr His Leu Ser Pro Ala 

250 255 260 

aag acc cca cca cct ccg cct gcc atg att ccc atg ggc cct ccc tat 1051 
Lys Thr Pro Pro Pro Pro Pro Ala Met He Pro Met Gly Pro Pro Tyr 
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265 270 275 



ggg tac 


cct gga 


gac 


j_ i_ i_ 


gac 


aga 


cac 


age 


t- r> a 

tea 


gec 


ggt 


ggc 


cac 


age 




Gly Tyr 


Pro Gly 


ASp 


pne 


Asp 


Arg 


H13 




Ser 


Wax 


\j±y 


\3±Y 


tlx S 


Cor* 




280 








Oflc 
Zo j 










? Q0 












tec caa 


gta ccc 


ctg 


ctg 


cgt 


gac 


gtg 


CIA t - 

y ci u 


ggc 




y 






y ad 


1147 


Ser Gin 


Val Pro 


Leu 


Leu 


Arg 


Asp 


vai 


Asp 


\3 j.y 


Ser 


vai 


Ser 


Ser 


r*i ii 

ulu 




295 






*i n A 
300 










inc. 










n n 




gta cga 


agt ggc 


tac 




ate 


cag 


get 


aac 


cag 


caa 


ga u 


gac 


tec 


aty 


1 1 QC, 


Val Arg 


Ser Gly 


Tyr 


Arg 


lie 


Gin 


Aia 


Asn 


tjin 


oin 


Asp 


Asp 


Ser 


Mec 








315 










1 O n 










IOC 
J Z D 






agg gtc 


eta tac 


tat 


atg 


gag 


aaa 


gag 


eta 


gec 


aac 


^ ^ ^ 


gac 


CCC 


tec 


1 *5 A 1 


Arg Val 


Leu Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


pne 


Asp 


Pro 


Ser 






330 










335 










340 








cga cct 


ggc cct 


ccc 


aat 


ggc 


aga 


gtg 


gaa 


egg 


gee 


atg 


agt 


gaa 


gta 


1291 


Arg Pro 


Gly Pro 


Pro 


Asn 


Gly 


Arg 


Val 


Glu 


Arg 


Ala 


Met 


Ser 


Glu 


Val 




345 








350 










355 










acc tec 


etc cat 


gaa 


gat 


gac 


tgg 


cga 


teg 


agg 


cct 


tec 


agg 


get 


cct 


1339 


Thr Ser 


Leu His 


Glu 


Asp 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


Ala 


Pro 




360 








365 










370 












gec etc 


acc ccc 


ate 


agg 


gat 


gag 


gag 


tgg 


aat 


cgc 


cac 


tec 


cca 


cag 


13 87 


Ala Leu 


Thr Pro 


lie 


Arg 


Asp 


Glu 


Glu 


Trp 


Asn 


Arg 


His 


Ser 


Pro 


Gin 




375 






380 










385 










390 




agt ccc 


aga aca 


tgg 


gag 


cag 


gaa 


CCC 


ctt 


caa 


gaa 


caa 


cca 


agg 


ggt 


1435 


Ser Pro 


Arg Thr 


Trp 


Glu 


Gin 


Glu 


Pro 


Leu 


Gin 


Glu 


Gin 


Pro 


Arg 


Gly 








395 










400 










405 






ggt tgg 


ggg tct 


gga 


cgc 


cct 


egg 


gee 


cgc 


tct 


gtg 


gat 


get 


eta 


gat 


1483 


Gly Trp 


Gly Ser 


Gly 


Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 


Ala 


Leu 


Asp 






410 










415 










420 








gat ate 


aac egg 


cct 


ggc 


tec 


act 


gaa 


tea 


gga 


egg 


tct 


tct 


ccc 


cca 


1531 


Asp lie 


Asn Arg 


Pro 


Gly 


Ser 


Thr 


Glu 


Ser 


Gly 


Arg 


Ser 


Ser 


Pro 


Pro 






425 








430 










435 










agt agt 


gga egg 


aga 


gga 


egg 


gee 


tat 


gca 


cct 


cca 


aga 


agt 


cgc 


age 


1579 


Ser Ser 


Gly Arg 


Arg 


Gly 


Arg 


Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 




440 








445 










450 












egg gat 


gac etc 


tat 


gac 


ccg 


gac 


gat 


cct 


agg 


gac 


ttg 


cca 


cat 


tec 


1627 


Arg Asp 


Asp Leu 


Tyr 


Asp 


Pro 


Asp 


Asp 


Pro 


Arg 


Asp 


Leu 


Pro 


His 


Ser 




455 






460 










465 










470 




cga gat 


ccc cac 


tat 


tat 


gac 


gac 


ate 


agg 


tct 


aga 


gat 


cca 


cgt 


get 


1675 


Arg Asp 


Pro His 


Tyr 


Tyr 


Asp 


Asp 


lie 


Arg 


Ser 


Arg 


Asp 


Pro 


Arg 


Ala 








475 










480 










485 






gac ccc 


aga tec 


cgt 


cag 


cga 


tec 


cga 


gat 


cct 


egg 


gat 


get 


ggc 


ttc 


1723 


Asp Pro 


Arg Ser 


Arg 


Gin 


Arg 


Ser 


Arg 


Asp 


Pro 


Arg 


Asp 


Ala 


Gly 


Phe 






490 










495 










500 








agg tea 


agg gac 


cct 


cag 


tat 


gat 


ggg 


cga 


eta 


tta 


gaa 


gag 


get 


tta 


1771 


Arg Ser 


Arg Asp 


Pro 


Gin 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 


Glu 


Glu 


Ala 


Leu 






505 








510 










ETC 

bib 










aag aaa 


aag ggg 


teg 


ggc 


gag 


aga 


agg 


agg 


gtt 


tac 


agg 


gag 


gaa 


gaa 


1819 


Lys Lys 


Lys Gly 


Ser 


Gly 


Glu 


Arg 


Arg 


Arg 


Val 


Tyr 


Arg 


Glu 


Glu 


Glu 




con 
b2 Q 


















530 












gag gaa 


gag gag 


ggc 


caa 


cac 


ccc 


cca 


gca 


cct 


cca 


cct 


tac 


tea 


gag 


10 o / 


Glu Glu 


Glu Glu 


Gly 


Gin 


Tyr 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 


Glu 




535 






540 










545 










550 




act gac 


teg cag 


gee 


tea 


egg 


gag 


agg 


agg 


ctg 


aaa 


aag 


aat 


ttg 


gee 


1915 


Thr Asp 


Ser Gin 


Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


Leu 


Lys 


Lys 


Asn 


Leu 


Ala 








555 










560 










565 






ctg agt 


egg gaa 


agt 


tta 


gtc 


gtc 


tga 


tccacgtttt gtatgtagct 




1962 


Leu Ser 


Arg Glu 


Ser 


Leu 


Val 


Val 


* 



















570 575 



tttgtacttt ttttttaatt ggaatcaata ttgatgaaac ttcaagecta ataaaatgtc 2022 
taatcacaaa aaaaaaaa 2040 
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<212> PRT 

<213> Rattus norvegicus 



<400> 11 



Met 


Ala 


Pro 


Ala 


Ala 


Gly 


Ala 


Cys 


1 








5 








Ala 


Thr 


Val 


Val 


Phe 


Val 


Cys 


Leu 








20 










Pro 


Ala 


Ser 


Ala 


lie 


Gin 


Val 


Thr 






35 










40 


lie 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 




50 










55 




Thr 


Leu 


Thr 


Val 


Pro 


He 


Val 


He 


65 










70 


• 




Asp Arg 


lie 


Ala 


Asp 


Ala 


Phe 


Ser 










85 








Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 








100 










Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 






115 










120 


Asn 


Ala 


Val 


Thr 


Leu 


Gly 


Asp 


Tyr 




130 










135 




Thr Gly 


Asn 


Ala 


Asp 


Leu 


Thr 


Phe 


145 










150 






Gly Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 










165 








Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 


Leu 


He 








180 










Val 


Val 


Cys 


Leu 


Ala 


Ser 


Leu 


Leu 






195 










200 


Trp 


Cys 


Gin 


Cys 


Cys 


Pro 


His 


Thr 




210 










215 




Cys 


Cys 


Pro 


Asp 


Lys 


Cys 


Cys 


Cys 


225 










230 






Lys 


Ala 


Ala 


Thr 


Ser 


Gly 


Val 


Pro 










245 








Thr 


His 


Leu 


Ser 


Pro 


Ala 


Lys 


Thr 








260 










Pro 


Met 


Gly 


Pro 


Pro 


Tyr 


Gly 


Tyr 






275 










280 


Ser 


Val 


Gly 


Gly 


His 


Ser 


Ser 


Gin 




290 










295 




Gly Ser 


Val 


Ser 


Ser 


Glu 


Val 


Arg 


305 










310 






Gin 


Gin 


Asp 


Asp 


Ser 


Met 


Arg 


Val 










325 








Ala 


Asn 


Phe 


Asp 


Pro 


Ser 


Arg 


Pro 








340 










Arg 


Ala 


Met 


Ser 


Glu 


Val 


Thr 


Ser 






355 










360 


Arg 


Pro 


Ser 


Arg 


Ala 


Pro 


Ala 


Leu 




370 










375 




Asn Arg 


His 


Ser 


Pro 


Gin 


Ser 


Pro 


385 










390 






Gin 


Glu 


Gin 


Pro 


Arg 


Gly 


Gly 


Trp 










405 








Ser 


Val 


Asp 


Ala 


Leu 


Asp 


Asp 


He 








420 










Gly Arg 


Ser 


Ser 


Pro 


Pro 


Ser 


Ser 






435 










440 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 


Arg 


Asp 




450 










455 




Arg Asp 


Leu 


Pro 


His 


Ser 


Arg 


Asp 



Ala 


Gly 


Ala 


Pro 


Asp 


Ser 


His 


Pro 




10 










15 




Phe 


Leu 


He 


He 


Phe 


Cys 


Pro 


Asp 


25 










30 






Val 


Ser 


Asp 


Pro 


Tyr 


His 


Val 


Val 










45 








Pro 


Cys 


Thr 


Tyr 


Gin 


Met 


Ser 


Asn 








60 










Trp 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 


Arg 






75 










80 


Pro 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 


Leu 




90 










95 




Pro 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 


Glu 


105 










110 






Arg 


Val 


Val 


Ala 


Thr 


Lys 


Gin 


Gly 










125 








Tyr 


Gin 


Gly 


Arg 


Arg 


He 


Thr 


He 








140 










Glu 


Gin 


Thr 


Ala 


Trp 


Gly 


Asp 


Ser 






155 










160 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Gly 


Asn 




170 










175 




Val 


Leu 


Asp 


Trp 


Leu 


Phe 


Val 


Val 


185 
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Leu 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Cys 










205 








Cys 


Cys 


Cys 


Tyr 


Val 


Arg 


Cys 


Pro 








220 










Pro 


Glu 


Ala 


Leu 


Tyr 


Ala 


Ala 


Gly 






235 










240 


Ser 


He 


Tyr 


Ala 


Pro 


Ser 


He 


Tyr 




250 










255 




Pro 


Pro 


Pro 


Pro 


Pro 


Ala 


Met 


He 


265 










270 






Pro 


Gly 


Asp 


Phe 


Asp 


Arg 


His 


Ser 










285 








Val 


Pro 


Leu 


Leu 


Arg 


Asp 


Val 


Asp 








300 










Ser 


Gly 


Tyr 


Arg 


He 


Gin 


Ala 


Asn 






315 










320 


Leu 


Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 




330 










335 




Gly 
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Pro 


Asn 
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Arg 


Val 


Glu 


345 










350 
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Glu 
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Asp 


Trp 


Arg 
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He 


Arg 
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Glu 


Glu 


Trp 
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Pro 
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Arg 
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Arg 
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Asp 


Pro 
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He 


Arg 
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465 




470 






475 










480 


Ser Arg Asp 


Pro Arg 


Ala 


Asp 


Pro Arg Ser 
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Ser 
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Asp 
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Pro Arg Asp 
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Phe 


Arg 


Ser Arg Asp 


Pro 


Gin 


Tyr 


Asp 


Gly 


Arg 




500 






505 








510 






Leu Leu Glu 


Glu Ala Leu Lys 


jjys jjys v»xy 


Car 


\j±y 




Arg 


Arg 


Arg 


515 








520 






525 








Val Tyr Arg 


Glu Glu 


Glu 


Glu 


Glu Glu Glu 


Gly 


Gin 


Tyr 


Pro 


Pro 


Ala 


530 






535 






540 










Pro Pro Pro 


Tyr Ser 


Glu 


Thr 


Asp Ser Gin 


Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


545 




550 






555 










560 


Leu Lys Lys 


Asn Leu 


Ala 


Leu 


Ser Arg Glu 


Ser 


Leu 


Val 


Val 







565 570 



<210> 12 
<211> 1893 
<212> DNA 

<213> Rattus norvegicus 
<400> 12 

accgctcacc aggtcagttg tccccggaaa gccgaaggca tgagcttcgc ccaagttctt 60 
tttatgggtt agaactcctc cagagcgggg gaaaaaggac ttggaatagg ggcgggacgg 120 
agcacgcacc cttctccgcc ttggttctcg ccgcgccccc tactctcggg atacttggga 180 
ggggacgcgc gggcaccgtc gctgctagac ggccgcg atg gcg ccg gcg gcc ggc 235 

Met Ala Pro Ala Ala Gly 
1 5 



gcg 


tgt 


get 


ggg 


gcg 


cct 


gac 


tec 


cac 


cca 


get 


acc 


gtg 


gtc 


ttc 


gtg 


283 


Ala 


Cys 


Ala 


Gly 


Ala 


Pro 


Asp 


Ser 


His 


Pro 


Ala 


Thr 


Val 


Val 


Phe 


Val 










10 










15 










20 








tgt 


etc 


ttt 


etc 


ate 


att 


ttc 


tgc 


cca 


gac 


cct 


gcc 


agt 


gcc 


ate 


cag 


331 


Cys 


Leu 


Phe 


Leu 


He 


He 


Phe 


Cys 


Pro 


Asp 


Pro 


Ala 


Ser 


Ala 


He 


Gin 








25 










30 










35 










gtg 


act 


gtg 


tct 


gac 


ccc 


tac 


cac 


gta 


gtg 


ate 


ctg 


ttc 


cag 


cca 


gtg 


379 


Val 


Thr 


Val 


Ser 


Asp 


Pro 


Tyr 


His 


Val 


Val 


He 


Leu 


Phe 


Gin 


Pro 


Val 






40 










45 










50 












acc 


ctg 


ccc 


tgc 


acc 


tat 


cag 


atg 


age 


aac 


act 


etc 


aca 


gtc 


ccc 


ate 


427 


Thr 


Leu 


Pro 


Cys 


Thr 


Tyr 


Gin 


Met 


Ser 


Asn 


Thr 


Leu 


Thr 


val 


Pro 


He 




55 










60 










65 










70 




gtg 


ate 


tgg 


aag 


tac 


aag 


tea 


ttc 


tgc 


egg 


gac 


cgt 


att 


gcc 


gat 


gcc 


475 


Val 


He 


Trp 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 


Arg 


Asp 


Arg 


He 


Ala 


Asp 


Ala 












75 










80 










85 






ttc 


tct 


cct 


gcc 


agt 


gtg 


gac 


aac 


cag 


eta 


aat 


gcc 


cag 


ttg 


gca 


get 


523 


Phe 


Ser 


Pro 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 


Leu 


Asn 


Ala 


Gin 


Leu 


Ala 


Ala 










90 










95 










100 








ggc 


aac 


ccc 


ggc 


tac 


aac 


ccc 


tat 


gtg 


gag 


tgc 


cag 


gac 


agt 


gta 


cgc 


571 


Gly 


Asn 


Pro 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 








105 










110 










115 










act 


gtc 


agg 


gtg 


gtg 


gcc 


acc 


aaa 


cag 


ggc 


aat 


gcg 


gtg 


acc 


ctg 


gga 


619 


Thr 


Val 


Arg 


Val 


Val 


Ala 


Thr 


Lys 


Gin 


Gly 


Asn 


Ala 


Val 


Thr 


Leu 


Gly 






120 










125 










130 












gac 


tac 


tac 


caa 


ggc 


agg 


agg 


ate 


acc 


ata 


aca 


gga 


aat 


get 


gac 


ctg 


667 


Asp 


Tyr 


Tyr 


Gin 


Gly 


Arg 


Arg 


He 


Thr 


He 


Thr 


Gly 


Asn 


Ala 


Asp 


Leu 




135 










140 










145 










150 




acc 


ttc 


gag 


cag 


aca 


gcc 


tgg 


gga 


gac 


agt 


gga 


gtg 


tat 


tac 


tgc 


tct 


715 


Thr 


Phe 


Glu 


Gin 


Thr 


Ala 


Trp 


Gly 


Asp 


Ser 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 












155 










160 










165 






gtg 


gtc 


teg 


gcc 


caa 


gat 


ctg 


gat 


gga 


aac 


aac 


gag 


gcg 


tac 


gca 


gag 


763 


Val 


Val 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Gly 


Asn 


Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 










170 










175 










180 








etc 


ate 


gtc 


ctt 


gtt 


tat 


get 


get 


ggc 


aaa 


gca 


gcc 


acc 


tea 


ggt 


gtc 


811 


Leu 


He 


Val 


Leu 


Val 


Tyr 


Ala 


Ala 


Gly 


Lys 


Ala 


Ala 


Thr 


Ser 


Gly 


Val 








185 










190 










195 










ccg 


age 


ate 


tat 


gcc 


ccc 


age 


ate 


tat 


acc 


cac 


etc 


tea 


cct 


gcc 


aag 


859 



WO 01/21647 PCT/IB00/01470 

44 



Pro 


Car 

ber 


Tl 0 

i le 


Tyr 


a. j. a 


rlO 


Cor 


lie 


Tyr 


1 ill. 


rid. a 


T.aii 
ucu 




Dfrt 

<rJL \J 


Al a 


T ,\rc 

uy o 






A Aft 

200 










o a c 
2. Ob 










•11 ft 
Z1U 












acc 


cca 


cca 


cct 


ccg 


cct 


gec 


aCy 




CCC 


Ato 


cine* 

ggc 


\~ V- u 


r* r* 


tat 


ggg 




Tnr 


pro 


pro 


Pro 


Pro 


Pro 


ax a 


Met- 
ric U 


Tip 

lie 


rl U 






tr i. \J 




iyr 


Ci\ v 
uxy 




215 




















Z z o 














tac 


cct 


gga 


gac 


LLL 


gac 


aga 


Cat 


age 


t" r*;a 


at- t- 

gtc 


ggt 


not* 




ay *w 




9^ 


Tyr 


Pro 


Gly 


T\ An 

ASp 


pile 


ASp 


Arg 


HIS 


Cat- 

ber 


Ser 


Val 


m \r 
\jXy 




Mi cs 

nis 


Ser 


Cor 












235 










240 










24 b 






caa 


gta 


ccc 


ctg 


ctg 


cgt 


gac 


gtg 


gar 


ggc 


art t- 

agc 


gta 


t- r-t- 
CCl 


tea 


gaa 




1UUJ 


Gin 


Val 


Pro 


Leu 


Leu 


Arg 


Asp 


Val 


Asp 


Gly 


ber 


vai 


ber 


C A V 

ber 


GlU 


vai 










250 










255 










260 








cga 


agt 


ggc 


tac 


agg 


ate 


cag 


get 


aac 


cag 


caa 


gat 


gac 


tec 


atg 


agg 


lUb 1 


Arg 


Ser 


Gly 


Tyr 


Arg 


He 


Gin 


Ala 


Asn 


Gin 


Gin 


Asp 


Asp 


Ser 


Met 


Arg 








265 










270 










275 










gtc 


eta 


tac 


tat 


atg 


gag 


aaa 


gag 


eta 


gee 


aac 


ttt 


gac 


cct 


tec 


cga 


T A O Q 

1099 


val 


Leu 


Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


Phe 


Asp 


Pro 


Ser 


Arg 






280 










285 










290 












cct 


ggc 


cct 


ccc 


aat 


ggc 


aga 


gtg 


gaa 


egg 


gee 


atg 


agt 


gaa 


gta 


acc 


1147 


Pro Gly 


Pro 


Pro 


Asn 


Gly 


Arg 


Val 


Glu 


Arg 


Ala 


Met 


Ser 


Glu 


Val 


Thr 




295 










300 










305 










310 




tec 


etc 


cat 


gaa 


gat 


gac 


tgg 


cga 


teg 


agg 


cct 


tec 


agg 


get 


cct 


gee 


119b 


Ser 


Leu 


His 


Glu 


Asp 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


Ala 


Pro 


Ala 












315 










320 










325 






etc 


acc 


CCC 


ate 


agg 


gat 


gag 


gag 


tgg 


aat 


cgc 


cac 


tec 


cca 


cag 


agt 


124 J 


Leu 


Thr 


Pro 


He 


Arg 


Asp 


Glu 


Glu 


Trp 


Asn 


Arg 


His 


Ser 


Pro 


Gin 


ser 










330 










335 










340 








ccc 


aga 


aca 


tgg 


gag 


cag 


gaa 


ccc 


ctt 


caa 


gaa 


caa 


cca 


agg 


ggt 


ggt 


1291 


Pro Arg 


Thr 


Trp 


Glu 


Gin 


Glu 


Pro 


Leu 


Gin 


Glu 


Gin 


Pro 


Arg 


Gly 


Gly 








345 










350 










355 










tgg 


ggg 


tct 


gga 


cgc 


cct 


egg 


gee 


cgc 


tct 


gtg 


gat 


get 


eta 


gat 


gat 


1339 


Trp Gly 


Ser 


Gly 


Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 


Ala 


Leu 


Asp 


Asp 






360 










365 










370 












ate 


aac 


egg 


cct 


ggc 


tec 


act 


gaa 


tea 


gga 


egg 


tct 


tct 


ccc 


cca 


agt 


1387 


He 


Asn 


Arg 


Pro 


Gly 


Ser 


Thr 


Glu 


Ser 


Gly 


Arg 


Ser 


Ser 


Pro 


Pro 


Ser 




375 










380 










385 










390 




agt 


gga 


egg 


aga 


gga 


egg 


gec 


tat 


gca 


cct 


cca 


aga 


agt 


cgc 


age 


egg 


1435 


Ser Gly 


Arg 


Arg 


Gly 


Arg 


Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 


Arg 












395 










400 










405 






gat 


gac 


etc 


tat 


gac 


ccg 


gac 


gat 


cct 


agg 


gac 


ttg 


cca 


cat 


tec 


cga 


1483 


Asp Asp 


Leu 


Tyr 


Asp 


Pro 


Asp 


Asp 


Pro 


Arg 


Asp 


Leu 


Pro 


His 


Ser 


Arg 










410 










415 










420 








gat 


ccc 


cac 


tat 


tat 


gac 


gac 


ate 


agg 


tct 


aga 


gat 


cca 


cgt 


get 


gac 


1531 


Asp 


Pro 


Has 


Tyr 


Tyr 


Asp 


Asp 


He 


Arg 


Ser 


Arg 


Asp 


Pro 


Arg 


T4 1 — 

Ala 


Asp 








425 










4 30 










435 










ccc 


aga 


tec 


cgt 


cag 


cga 


tec 


cga 


gat 


cct 


egg 


gat 


get 


ggc 


ttc 


agg 


1 C "T O 

1579 


Pro Arg 


Ser 


Arg 


Gin 


Arg 


Ser 


Arg 


Asp 


Pro 


A -v r~t 

Arg 


ASp 


Ala 


Gly 


pne 


Arg 






440 










445 










450 












tea 


agg 


gac 


cct 


cag 


tat 


gat 


ggg 


cga 


eta 


tta 


gaa 


gag 


get 


tta 


aag 


162 7 


Ser Arg 


Asp 


Pro 


Gin 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 


Glu 


Glu 


Ala 


Leu 


Lys 




455 










A C (\ 

460 










A £ C 

465 










A H f\ 

47 0 




aaa 


aag 


ggg 


teg 


ggc 


gag 


aga 


agg 


agg 


gtt 


cac 


agg 


gag 


gaa 


gaa 


gag 


lb /b 


Lys 


Lys 


Giy 


ber 


Gly 


G1U 


Arg 


A -v-^-» 

Arg 


Arg 


vai 


Tyr 


Arg 


GlU 


Glu 


GlU 


GlU 












475 










480 










485 






gaa 


gag 


gag 


ggc 


caa 


tac 


ccc 


cca 


gca 


cct 


cca 


cct 


tac 


tea 


gag 


act 


i Tin 
1 / Z $ 


Glu 


Glu 


G1U 


Giy 


Gin 


Tyr 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 


Glu 


Thr 










490 










495 










500 








gac 


teg 


cag 


gee 


tea 


egg 


gag 


agg 


agg 


ctg 


aaa 


aag 


aat 


ttg 


gec 


ctg 


1771 


Asp 


Ser 


Gin 


Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


Leu 


Lys 


Lys 


Asn 


Leu 


Ala 


Leu 








505 










510 










515 










agt 


egg 


gaa 


agt 


tta 


gtc 


gtc 


tga 


tccacgtttt gtatgtagct tttgtacttt 


1825 


Ser Arg 


Glu 


Ser 


Leu 


Val 


val 


* 





















520 525 
ttttttaatt ggaatcaata ttgatgaaac ttcaagecta ataaaatgtc taatcacaaa 1885 



WO 01/21647 PCT/1B00/01470 

45 

aaaaaaaa 1893 

<210> 13 
<211> 525 
<212> PRT 

<213> Rattus norvegicus 



<400> 13 



Met 


Ala 


Pro 


Ala 


1 








Ala 


Thr 


val 


Val 








20 


Pro 


Ala 


Ser 


Ala 






35 




He 


Leu 


Phe 

rue 


Gin 




50 






Thr 


Leu 


Thr 


Val 


65 








Asp Arg 


He 


Ala 


Asn 


Ala 


Gin 


Leu 








100 


Cys 


Gin 


Asp 


Ser 






115 




Asn 


Ala 


Val 


Thr 




130 






Thr Gly 


Asn 


Ala 


145 








Gly Val 


Tyr 


Tyr 


Asn 


Glu 


Ala 


Tyr 








180 


Ala 


Ala 


Thr 


Ser 






195 




His 


Leu 


Ser 


Pro 




210 






Met Gly 


Pro 


Pro 


225 








Val Gly 


Gly 


His 


Ser 


Val 


Ser 


Ser 








260 


Gin Asp 


Asp 


Ser 






275 




Asn 


Phe 


Asp 


Pro 




290 






Ala 


Met 


Ser 


Glu 


305 








Pro 


Ser 


Arg 


Ala 


Arg 


His 


Ser 


Pro 








340 


Glu 


Gin 


Pro 


Arg 






355 




Val 


Asp 


Ala 


Leu 




370 






Arg 


Ser 


Ser 


Pro 


385 








pro Arg 


Ser 


Arg 


Asp 


Leu 


Pro 


His 








420 


Arg 


Asp 


Pro 


Arg 



Ala 


Gly Ala 


Cys 


5 








Phe 


Val 


Cys 


Leu 


He 


Gin 


Val 


Thr 








40 


Pro 


Val 


Thr 


Leu 






55 




r l u 


He 


Val 


He 




70 






Asp 


Ala 


Phe 


Ser 


85 








Ala 


Ala Gly 


Asn 


Val 


Arg 


Thr 


Val 








120 


Leu 


Gly Asp 


Tyr 






135 




Asp 


Leu 


Thr 


Phe 




150 






Cys 


Ser 


Val 


Val 


165 








Ala 


Glu 


Leu 


He 


Gly 


Val 


Pro 


Ser 








200 


Ala 


Lys 


Thr 


Pro 






215 




Tvr 


Gly Tyr 


Pro 




230 






Ser 


Ser 


Gin 


Val 


245 








Glu 


Val 


Arg 


Ser 


Met 


Arg Val 


Leu 








280 


Ser 


Arg 


Pro 


Gly 






295 




Val 


Thr 


Ser 


Leu 




310 






Pro 


Ala 


Leu 


Thr 


325 








Gin 


Ser 


Pro 


Arg 


Gly 


Gly Trp 


Gly 








360 


Asp 


Asp 


He 


Asn 






375 




Pro 


Ser 


Ser 


Gly 




390 






Ser 


Arg 


Asp 


Asp 


405 








Ser 


Arg Asp 


Pro 


Ala 


Asp 


Pro 


Arg 



Ala 


Gly Ala 


Pro 




10 






Phe 


Leu 


He 


He 


25 








Val 


Ser Asp 


Pro 


Pro 


Cys 


Thr 


Tyr 








60 


Trp 


Lys 


Tyr 


Lys 






75 




Pro 


Ala 


Ser 


Val 




90 






Pro 


Gly Tyr 


Asn 


105 








Arg 


Val 


Val 


Ala 


Tyr 


Gin Gly 


Arg 








140 


Glu 


Gin 


Thr 


Ala 






155 




Ser 


Ala 


Gin 


Asp 




170 






Val 


Leu 


Val 


Tyr 


185 








He 


Tyr 


Ala 


Pro 


Pro 


Pro 


Pro 


Pro 








220 


Gly 


Asp 


Phe 


Asp 






235 




Pro 


Leu 


Leu 


Arg 




250 






Gly 


Tyr 


Arg 


He 


265 








Tyr 


Tyr 


Met 


Glu 


Pro 


Pro 


Asn 


Gly 








300 


His 


Glu 


Asp 


Asp 






315 




Pro 


He 


Arg 


Asp 




330 






Thr 


Trp 


Glu 


Gin 


345 








Ser 


Gly Arg 


Pro 


Arg 


Pro Gly 


Ser 








380 


Arg 


Arg Gly 


Arg 






395 




Leu 


Tyr Asp 


Pro 




410 






His 


Tyr Tyr 


Asp 


425 








Ser 


Arg Gin 


Arg 



Asp Ser His 


Pro 


15 




Phe Cys Pro 


Asp 


30 




Tyr His Val 


Val 


45 




Gin Met Ser 


Asn 


Ser Phe Cys 


Arg 




80 


Asp Asn Gin 


Leu 


95 




Pro Tyr Val 


Glu 


110 




Thr Lys Gin 


Gly 


125 




Arg He Thr 


He 


Trp Gly Asp 


Ser 




160 


Leu Asp Gly 


Asn 


175 




Ala Ala Gly 


Lys 


190 




Ser He Tyr 


Thr 


205 




Ala Met He 


Pro 


Arg His Ser 


Ser 




240 


Asp Val Asp 


Gly 


255 




Gin Ala Asn 


Gin 


270 




Lys Glu Leu 


Ala 


285 




Arg Val Glu 


Arg 


Trp Arg Ser 


Arg 




320 


Glu Glu Trp 


Asn 


335 




Glu Pro Leu 


Gin 


350 




Arg Ala Arg 


Ser 


365 




Thr Glu Ser 


Gly 


Ala Tyr Ala 


Pro 




400 


Asp Asp Pro 


Arg 


415 




Asp He Arg 


Ser 


430 




Ser Arg Asp 


Pro 



WO 01/21647 PCT/IBOO/01470 

46 

435 440 445 



Arg Asp 


Ala 


Gly 


Phe 


Arg 


Ser 


Arg 


Asp 


Pro 


uxn 


Tyr 


Asp 


vji.y 


Arg 


Leu 




450 










455 










4 60 












Leu Glu 


Glu 


Ala 


Leu 


Lys 


Lys 


Lys 


Gly 


Ser 


Gly 


Glu 


Arg 


Arg 


Arg 


Val 




465 








470 










475 










480 




Tyr Arg 


Glu 




Glu 


Glu 


Glu 


UlU 


VJJL U 


vji y 




TVr 

iyr 


Dm 




ala 

rix a 










485 




















H 5 J 






Pro Pro 


Tyr 


Ser 


Glu Thr Asp 


Ser 


m « 
uin 


a j. a 


Ser 


Arg 




Arg 


Arg 


Leu 








500 










505 










510 








Lys Lys 


Asn 


Leu 


Ala 


Leu 


Ser 


Arg 


Glu 


Ser 


Leu 


val 


Val 












515 










520 










C 1 c 

b^b 










<210> 14 






























<211> 1886 






























<212> DNA 






























<213> Mus musculus 


























<400> 14 






























gcaccgtcgc tgctagacgg ccgcg 


atg 


gcg 


ccg 


gcg 


gec 


age 


gcg 


tgt 


get 


52 














Met 
1 


Ala 


Pro 


Ala 


Ala 
5 


Ser 


Ala 


Cys 


Ala 




ggg gcg 


cct 


ggc 


tec 


cac 


ccg 


gec 


acc 


acg 


ate 


ttc 


gtg 


tgt 


ctt 


ttt 


100 


Gly Ala 


Pro 


Gly 


Ser 


His 


Pro 


Ala 


Thr 


Thr 


He 


Phe 


Val 


Cys 


Leu 


Phe 




10 








15 










20 










25 




etc ate 


att 


tac 


tgc 


cca 


gac 


cgt 


gec 


agt 


gec 


ate 


cag 


gtg 


acc 


gtg 


148 


Leu lie 


He 


Tyr 


Cys 


Pro 


Asp 


Arg 


Ala 


Ser 


Ala 


He 


Gin 


Val 


Thr 


Val 










30 










35 










40 






cct gac 


ccc 


tac 


cac 


gta 


gtg 


ate 


ctg 


ttc 


cag 


cca 


gtg 


aca 


eta 


cac 


196 


Pro Asp 


Pro 


Tyr 


His 


Val 


Val 


He 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


His 








45 










50 










55 








tgc ace 


tac 


cag 


atg 


age 


aat 


acc 


etc 


aca 


gee 


cct 


ate 


gtg 


ate 


tgg 


244 


Cys Thr 


Tyr 


Gin 


Met 


Ser 


Asn 


Thr 


Leu 


Thr 


Ala 


Pro 


He 


Val 


He 


Trp 






60 










65 










70 










aag tat 


aag 


teg 


ttc 


tgt 


egg 


gac 


cgt 


gtt 


gec 


gac 


gec 


ttc 


tec 


cct 


292 


Lys Tyr 


Lys 


Ser 


Phe 


Cys 


Arg 


Asp 


Arg 


val 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 




75 










80 










85 












gec age 


gtg 


gac 


aac 


cag 


etc 


aac 


gee 


cag 


ctg 


gcg 


get 


ggc 


aac 


ccc 


340 


Ala Ser 


Val 


Asp 


Asn 


Gin 


Leu 


Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 


Pro 




90 








95 










100 










10b 




ggc tac 


aac 


ccc 


tat 


gtg 


gag 


tgc 


cag 


gac 


age 


gta 


cgc 


act 


gtc 


agg 


388 


Gly Tyr 


Asn 


Pro 


Tyr 


Val 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 










110 










115 










12 0 






gtg gtg 


gec 


acc 


aaa 


cag 


ggc 


aat 


get 


gtg 


acc 


ctg 


gga 


gac 


tac 


tac 


436 


Val Val 


Ala 


Thr 


Lys 


Gin 


Gly 


Asn 


Ala 


val 


Thr 


Leu 


Gly 


Asp 


Tyr 


Tyr 








125 










130 










13 b 








cag ggc 


agg 


aga 


ate 


acc 


ate 


aca 


gga 


aat 


get 


gg c 


ctg 


acc 


ttc 


gag 


a a a 
484 


Gin Gly 


Arg 


Arg 


He 


Thr 


He 


Thr 


Gly 


Asn 


TV 1 -i 

Ala 


Gly 


Leu 


Tnr 


Phe 


CjIU 






140 










x*i b 










1 jU 










cag acg 


gec 


tgg 


gga 


gac 


agt 


gga 


gtg 


rat 


tac 


tgc 


tec 


gtg 


gtc 


t* r* a 

cca 


b3 z 


Gin Thr 


Ala 


Trp 


Gly 


Asp 


Ser 


/-i i , , 
\jiy 


vai 


Tyr 


Tyr 


Cys 


oer 


vai 


vai 


Ser 




155 










160 










lob 












gee caa 


gat 


ctg 


gat 


ggg 


aac 


aac 


gag 


gcg 


tac 


gca 


gag 


etc 


att 


gtc 


580 


Ala Gin 


Asp 


Leu 


Asp 


Gly 


Asn 


Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 


Leu 


lie 


val 




170 








175 










"i o 
1 o U 










lob 




ctt ggc 


agg 


acc 


tea 


gaa 


gec 


cct 


gag 


etc 


eta 


cct 


ggt 


ttt 


egg 


gcg 


628 


Leu Gly 


Arg 


Thr 


Ser 


Glu 


Ala 


Pro 


Glu 


Leu 


Leu 


Pro 


Gly 


Phe 


Arg 


Ala 










190 










195 










200 






ggg ccc 


ttg 


gaa 


gat 


tgg 


etc 


ttt 


gtg 


gtc 


gtg 


gtc 


tgc 


ctg 


gca 


age 


676 


Gly Pro 


Leu 


Glu 


Asp 


Trp 


Leu 


Phe 


Val 


Val 


Val 


val 


Cys 


Leu 


Ala 


Ser 








205 










210 










215 








etc etc 


ttc 


ttc 


etc 


etc 


ctg 


ggc 


ate 


tgc 


tgg 


tgc 


cag 


tgc 


tgt 


ccc 


724 


Leu Leu 


Phe 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Cys 


Trp 


Cys 


Gin 


Cys 


Cys 


Pro 






220 










225 










230 











WO 01/21647 



cac 


acc 


tgc 


tgc 


tgc 


tat 


His 


Thr 


Cys 


Cys 


Cys 


Tyr 




235 










tgt 


tgc 


cct 


gag 


gec 


ctt 


Cys Cys 


Pro 


Glu 


Ala 


Leu 


250 










255 


gtg 


cca 


age 


ate 


tat 


gee 


Val 


Pro 


Ser 


He 


Tyr 


Ala 










270 




aag 


act 


ccg 


cca 


cct 


ccg 


Lys 


Thr 


Pro 


Pro 


Pro 


Pro 








285 






ggg 


tac 


cct 


gga 


gac 


ttt 


Gly Tyr 


Pro 


Gly 


Asp 


Phe 






300 








tec 


cag 


gtg 


ccc 


ctg 


ctg 


Ser 


Gin 


Val 


Pro 


Leu 


Leu 




315 










gta 


cga 


agt 


ggc 


tac 


agg 


Val 


Arg 


Ser 


Gly 


Tyr 


Arg 


330 










335 


agg 


gtc 


eta 


tac 


tat 


atg 


Arg 


Val 


Leu 


Tyr 


Tyr 


Met 










350 




egg 


cct 


ggc 


cct 


ccc 


aat 


Arg 


Pro 


Gly 


Pro 


Pro 


Asn 








365 






acc 


tec 


etc 


cat 


gaa 


gat 


Thr 


Ser 


Leu 


His 


Glu 


Asp 






380 








gee 


etc 


aca 


ccc 


ate 


agg 


Ala 


Leu 


Thr 


Pro 


He 


Arg 




395 










agt 


ccc 


aga 


aca 


tgg 


gag 


Ser 


Pro 


Arg 


Thr 


Trp 


Glu 


410 










415 


ggt 


tgg 


ggg 


tct 


ggg 


egg 


Gly Trp 


Gly 


Ser 


Gly 


Arg 










430 




gac 


ate 


aac 


egg 


cct 


ggc 


Asp 


lie 


Asn 


Arg 


Pro 


Gly 








445 






agt 


agt 


gga 


egg 


aga 


ggg 


Ser 


Ser 


Gly 


Arg 


Arg 


Gly 






460 








egg 


gat 


gac 


etc 


tat 


gac 


Arg Asp 


Asp 


Leu 


Tyr 


Asp 




475 










cga 


gat 


ccc 


cac 


tat 


tat 


Arg Asp 


Pro 


His 


Tyr 


Tyr 


490 










495 


gac 


ccc 


aga 


tec 


cgt 


cag 


Asp 


Pro 


Arg 


Ser 


Arg 


Gin 










510 




agg 


tea 


egg 


gac 


cct 


cag 


Arg 


Ser 


Arg 


Asp 


Pro 


Gin 








525 






aag 


aaa 


aaa 


ggg 


get 


ggg 


Lys 


Lys 


Lys 


Gly 


Ala 


Gly 






540 








gaa 


gaa 


gaa 


gag 


gag 


ggc 


Glu 


Glu 


Glu 


Glu 


Glu 


Gly 



555 



47 



gtc 


aga 


tgt 


ccc 


tgc 


tgc 


Val 


Arg 


Cys 


Pro 


Cys 


Cys 


240 










245 


tat 


get 


get 


ggc 


aaa 


gca 


Tyr 


Ala 


Ala 


Gly 


Lys 


Ala 










260 




ccc 


age 


ate 


tat 


acc 


cac 


Pro 


Ser 


He 


Tyr 


Thr 


His 








275 






cct 


gec 


atg 


att 


ccc 


atg 


Pro 


Ala 


Met 


He 


Pro 


Met 






290 








gac 


agg 


acc 


age 


tea 


gtt 


Asp 


Arg 


Thr 


Ser 


Ser 


Val 




305 










cgt 


gaa 


gtg 


gat 


ggg 


age 


Arg 


Glu 


Val 


Asp 


Gly 


Ser 


320 










325 


ate 


cag 


get 


aac 


cag 


caa 


He 


Gin 


Ala 


Asn 


Gin 


Gin 










340 




gag 


aag 


gag 


eta 


gee 


aac 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 








355 






ggc 


cga 


gtg 


gaa 


egg 


gee 


Gly 


Arg 


Val 


Glu 


Arg 


Ala 






370 








gac 


tgg 


cga 


tct 


egg 


cct 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 




385 










gat 


gag 


gag 


tgg 


aat 


cgc 


Asp 


Glu 


Glu 


Trp 


Asn 


Arg 


400 










405 


cag 


gaa 


ccc 


ctt 


caa 


gaa 


Gin 


Glu 


Pro 


Leu 


Gin 


Glu 










420 




cct 


egg 


gee 


cgc 


tct 


gtg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 








435 






tec 


act 


gaa 


tea 


gga 


agg 


Ser 


Thr 


Glu 


Ser 


Gly 


Arg 






450 








egg 


gee 


tat 


gca 


cct 


ccg 


Arg 


Ala 


Tyr 


Ala 


Pro 


Pro 




465 










ccc 


gac 


gat 


cct 


aga 


gac 


Pro 


Asp 


Asp 


Pro 


Arg 


Asp 


480 










485 


gat 


gat 


ttg 


agg 


tct 


agg 


Asp 


Asp 


Leu 


Arg 


Ser 


Arg 










500 




cga 


tec 


cac 


gat 


cct 


egg 


Arg 


Ser 


His 


Asp 


Pro 


Arg 








515 






tat 


gat 


ggg 


cga 


etc 


tta 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 






530 








gag 


aga 


aga 


cgc 


gtt 


tac 


Glu 


Arg 


Arg 


Arg 


Val 


Tyr 




545 










cac 


tat 


ccc 


cca 


gca 


cct 


His 


Tyr 


Pro 


Pro 


Ala 


Pro 


560 










565 



PCT/IBOO/01470 



cca gac aag tgc 772 
Pro Asp Lys Cys 

gee acc tea ggt 820 
Ala Thr Ser Gly 
265 

etc tct cct gee 868 
Leu Ser Pro Ala 
280 

cgt cct ccc tat 916 
Arg Pro Pro Tyr 
295 

ggt ggc cac age 964 

Gly Gly His Ser 

310 

gta tct tea gaa 1012 
Val Ser Ser Glu 

gat gac tec atg 1060 
Asp Asp Ser Met 
345 

ttc gat cct tec 1108 
Phe Asp Pro Ser 
360 

atg agt gaa gta 1156 
Met Ser Glu Val 
375 

tec agg get cct 1204 

Ser Arg Ala Pro 

390 

cac tec cct egg 1252 
His Ser Pro Arg 

cag cca agg ggt 1300 
Gin Pro Arg Gly 
425 

gat get eta gat 1348 
Asp Ala Leu Asp 
440 

tct tct ccc cca 1396 
Ser Ser Pro Pro 
455 

aga agt cgc age 1444 

Arg Ser Arg Ser 

470 

ttg cca cat tec 14 92 

Leu Pro His Ser 

gat cca cgt get 1540 
Asp Pro Arg Ala 
505 

gat get ggc ttc 1588 
Asp Ala Gly Phe 
520 

gaa gag get tta 1636 
Glu Glu Ala Leu 
535 

agg gag gaa gaa 1684 

Arg Glu Glu Glu 

550 

ccg cct tac tct 1732 
Pro Pro Tyr Ser 



WO 01/21647 PCT/IBOO/01470 

48 

gag act gac teg cag gee teg agg gag egg agg atg aaa aag aat ttg 1780 

Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg Met Lys Lys Asn Leu 

570 575 580 585 

gee ctg agt egg gaa agt tta gtc gtc tga tcccacgttt tgttatgtag 1830 

Ala Leu Ser Arg Glu Ser Leu Val Val * 

590 595 
cttttatact tttttaattg gaatattgat gaaactcttc accaagccta ataaaa 1886 

<210> 15 

<211> 1829 

<212> DNA 

<213> Mus musculus 

<400> 15 



gcaccgtcgc tgetagaegg ccgcg 


atg 


gcg 


ccg 


gcg 


gec 


age 


gcg 


tgt 


get 


52 
















Met 


Ala 


Pro 


Ala 


Ala 
5 


Ser 


Ala 


Cys 


Ala 




999 


gcg 


cct 


ggc 


tec 


cac 


ccg 


1 

gee 


acc 


acg 


ate 


ttc 


gtg 


tgt 


ctt 


ttt 


100 


Gly 


Ala 


Pro 


Gly 


Ser 


His 


Pro 


Ala 


Thr 


Thr 


He 


Phe 


Val 


Cys 


Leu 


Phe 




10 










15 










20 










25 




etc 


ate 


att 


tac 


tgc 


cca 


gac 


cgt 


gee 


agt 


gee 


ate 


cag 


gtg 


acc 


gtg 


148 


Leu 


He 


He 


Tyr 


Cys 


Pro 


Asp 


Arg 


Ala 


Ser 


Ala 


lie 


Gin 


Val 


Thr 


Val 












30 










35 










40 






cct 


gac 


ccc 


tac 


cac 


gta 


gtg 


ate 


ctg 


ttc 


cag 


cca 


gtg 


aca 


eta 


cac 


196 


Pro 


Asp 


Pro 


Tyr 


His 


Val 


Val 


He 


Leu 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


His 








45 










50 










55 








tgc 


acc 


tac 


cag 


atg 


age 


aat 


acc 


etc 


aca 


gee 


cct 


ate 


gtg 


ate 


tgg 


244 


Cys 


Thr 


Tyr 


Gin 


Met 


Ser 


Asn 


Thr 


Leu 


Thr 


Ala 


Pro 


He 


Val 


He 


Trp 








60 










65 










70 










aag 


tat 


aag 


teg 


ttc 


tgt 


egg 


gac 


cgt 


gtt 


gee 


gac 


gec 


ttc 


tec 


cct 


292 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 


Arg 


Asp 


Arg 


Val 


Ala 


Asp 


Ala 


Phe 


Ser 


Pro 






75 










80 










85 












gee 


age 


gtg 


gac 


aac 


cag 


etc 


aac 


gee 


cag 


ctg 


gcg 


get 


ggc 


aac 


ccc 


340 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 


Leu 


Asn 


Ala 


Gin 


Leu 


Ala 


Ala 


Gly 


Asn 


Pro 




90 










95 










100 










105 




ggc 


tac 


aac 


ccc 


tat 


gtg 


gag 


tgc 


cag 


gac 


age 


gta 


cgc 


act 


gtc 


agg 


388 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 


Glu 


Cys 


Gin 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 












110 










115 










120 






gtg 


gtg 


gec 


acc 


aaa 


cag 


ggc 


aat 


get 


gtg 


acc 


ctg 


gga 


gac 


tac 


tac 


436 


Val 


Val 


Ala 


Thr 


Lys 


Gin 


Gly 


Asn 


Ala 


Val 


Thr 


Leu 


Gly 


Asp 


Tyr 


Tyr 










125 










130 










135 








cag 


ggc 


agg 


aga 


ate 


acc 


ate 


aca 


gga 


aat 


get 


ggc 


ctg 


acc 


ttc 


gag 


484 


Gin 


Gly 


Arg 


Arg 


He 


Thr 


He 


Thr 


Gly 


Asn 


Ala 


Gly 


Leu 


Thr 


Phe 


Glu 








140 










145 










150 










cag 


a eg 


gee 


tgg 


gga 


gac 


agt 


gga 


gtg 


tat 


tac 


tgc 


tec 


gtg 


gtc 


tea 


532 


Gin 


Thr 


Ala 


Trp 


Gly 


Asp 


Ser 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 


Val 


Val 


Ser 






155 










160 










165 












gee 


caa 


gat 


ctg 


gat 


ggg 


aac 


aac 


gag 


gcg 


tac 


gca 


gag 


etc 


att 


gtc 


580 


Ala 


Gin 


Asp 


Leu 


Asp 


Gly 


Asn 


Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 


Leu 


He 


Val 




170 










175 










180 










185 




ctt 


gat 


tgg 


etc 


ttt 


gtg 


gtc 


gtg 


gtc 


tgc 


ctg 


gca 


age 


etc 


etc 


ttc 


628 


Leu 


Asp 


Trp 


Leu 


Phe 


Val 


Val 


Val 


Val 


Cys 


Leu 


Ala 


Ser 


Leu 


Leu 


Phe 












190 










195 










200 






ttc 


etc 


etc 


ctg 


ggc 


ate 


tgc 


tgg 


tgc 


cag 


tgc 


tgt 


ccc 


cac 


acc 


tgc 


676 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Cys 


Trp 


Cys 


Gin 


Cys 


Cys 


Pro 


His 


Thr 


Cys 










205 










210 










215 








tgc 


tgc 


tat 


gtc 


aga 


tgt 


ccc 


tgc 


tgc 


cca 


gac 


aag 


tgc 


tgt 


tgc 


cct 


724 


Cys 


Cys 


Tyr 


Val 


Arg 


Cys 


Pro 


Cys 


Cys 


Pro 


Asp 


Lys 


Cys 


Cys 


Cys 


Pro 








220 










225 










230 










gag 


gec 


ctt 


tat 


get 


get 


ggc 


aaa 


gca 


gee 


acc 


tea 


ggt 


gtg 


cca 


age 


772 


Glu 


Ala 


Leu 


Tyr 


Ala 


Ala 


Gly 


Lys 


Ala 


Ala 


Thr 


Ser 


Gly 


Val 


Pro 


Ser 






235 










240 










245 












ate 


tat 


gec 


ccc 


age 


ate 


tat 


acc 


cac 


etc 


tct 


cct 


gee 


aag 


act 


ccg 


820 



WO 01/21647 



PCT/IB00/01470 



49 



lie 


Tyr 


Ala 


Pro 


Ser 


He 


Tyr 


Thr 


HIS 


Leu 


Ser 


Pro 


Ala 


Lys 


Thr 


Pro 




250 










255 










260 










265 




cca 


cct 


ccg 


cct 


gee 


atg 


att 


ccc 


atg 


cgt 


cct 


ccc 


tat 


ggg 


tac 


cct 


868 


Pro 


Pro 


Pro 


Pro 


Ala 


Met 


He 


Pro 


Met 


Arg 


Pro 


Pro 


Tyr 


Gly 


Tyr 


Pro 












270 










275 










280 






gga 


gac 


ttt 


gac 


agg 


acc 


age 


tea 


gtt 


ggt 


ggc 


cac 


age 


tec 


cag 


gtg 


916 


Gly 


Asp 


Phe 


Asp 


Arg 


Thr 


Ser 


Ser 


Val 


Gly 


Gly 


His 


Ser 


Ser 


Gin 


Val 










285 










290 










295 








ccc 


ctg 


ctg 


cgt 


gaa 


gtg 


gat 


ggg 


age 


gta 


tct 


tea 


gaa 


gta 


cga 


agt 


964 


Pro 


Leu 


Leu 


Arg 


Glu 


Val 


Asp 


Gly 


Ser 


Val 


Ser 


Ser 


Glu 


Val 


Arg 


Ser 








300 










305 










310 










9gc 


tac 


agg 


ate 


cag 


get 


aac 


cag 


caa 


gat 


gac 


tec 


atg 


agg 


gtc 


eta 


1012 


Gly 


Tyr 


Arg 


He 


Gin 


Ala 


Asn 


Gin 


Gin 


Asp 


Asp 


Ser 


Met 


Arg 


Val 


Leu 






315 










320 










325 












tac 


tat 


atg 


gag 


aag 


gag 


eta 


gee 


aac 


ttc 


gat 


cct 


tec 


egg 


cct 


ggc 


1060 


Tyr 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


Phe 


Asp 


Pro 


Ser 


Arg 


Pro 


Gly 




330 










335 










340 










345 




cct 


ccc 


aat 


ggc 


cga 


gtg 


gaa 


egg 


gee 


atg 


agt 


gaa 


gta 


acc 


tec 


etc 


1108 


Pro 


Pro 


Asn 


Gly 


Arg 


Val 


Glu 


Arg 


Ala 


Met 


Ser 


Glu 


Val 


Thr 


Ser 


Leu 












350 










355 










360 






cat 


gaa 


gat 


gac 


tgg 


cga 


tct 


egg 


cct 


tec 


agg 


get 


cct 


gec 


etc 


aca 


1156 


His 


Glu 


Asp 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


Ala 


Pro 


Ala 


Leu 


Thr 










365 










370 










375 








ccc 


ate 


agg 


gat 


gag 


gag 


tgg 


aat 


cgc 


cac 


tec 


cct 


egg 


agt 


ccc 


aga 


1204 


Pro 


He 


Arg 


Asp 


Glu 


Glu 


Trp 


Asn 


Arg 


His 


Ser 


Pro 


Arg 


Ser 


Pro 


Arg 








380 










385 










390 










aca 


tgg 


gag 


cag 


gaa 


ccc 


ctt 


caa 


gaa 


cag 


cca 


agg 


ggt 


ggt 


tgg 


ggg 


1252 


Thr 


Trp 


Glu 


Gin 


Glu 


Pro 


Leu 


Gin 


Glu 


Gin 


Pro 


Arg 


Gly 


Gly 


Trp 


Gly 






395 










400 










405 












tct 


ggg 


egg 


cct 


egg 


gee 


cgc 


tct 


gtg 


gat 


get 


eta 


gat 


gac 


ate 


aac 


1300 


Ser 


Gly 


Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 


Ala 


Leu 


Asp 


Asp 


He 


Asn 




410 










415 










420 










425 




egg 


cct 


ggc 


tec 


act 


gaa 


tea 


gga 


agg 


tct 


tct 


ccc 


cca 


agt 


agt 


gga 


1348 


Arg 


Pro 


Gly 


Ser 


Thr 


GlU 


Ser 


Gly 


Arg 


Ser 


Ser 


Pro 


Pro 


Ser 


Ser 


Gly 












430 










435 










440 






egg 


aga 


ggg 


egg 


gee 


tat 


gca 


cct 


ccg 


aga 


agt 


cgc 


age 


egg 


gat 


gac 


1396 


Arg 


Arg 


Gly 


Arg 


Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 


Arg 


Asp 


Asp 










445 










450 










455 








etc 


tat 


gac 


ccc 


gac 


gat 


cct 


aga 


gac 


ttg 


cca 


cat 


tec 


cga 


gat 


ccc 


1444 


Leu 


Tyr 


Asp 


Pro 


Asp 


Asp 


Pro 


Arg 


Asp 


Leu 


Pro 


His 


Ser 


Arg 


Asp 


Pro 








460 










465 










470 










cac 


tat 


tat 


gat 


gat 


ttg 


agg 


tct 


agg 


gat 


cca 


cgt 


get 


gac 


ccc 


aga 


1492 


His 


Tyr 


Tyr 


Asp 


Asp 


Leu 


Arg 


Ser 


Arg 


Asp 


Pro 


Arg 


Ala 


Asp 


Pro 


Arg 






475 










480 










485 












tec 


cgt 


cag 


cga 


tec 


cac 


gat 


cct 


egg 


gat 


get 


ggc 


ttc 


agg 


tea 


egg 


1540 


Ser 


Arg 


Gin 


Arg 


Ser 


His 


Asp 


Pro 


Arg 


Asp 


Ala 


Gly 


Phe 


Arg 


Ser 


Arg 




490 










495 










500 










505 




gac 


cct 


cag 


tat 


gat 


ggg 


cga 


etc 


tta 


gaa 


gag 


get 


tta 


aag 


aaa 


aaa 


1588 


Asp 


Pro 


Gin 


Tyr 


Asp 


Gly 


Arg 


Leu 


Leu 


Glu 


Glu 


Ala 


Leu 


Lys 


Lys 


Lys 












510 










515 










520 






ggg 


get 


ggg 


gag 


aga 


aga 


cgc 


gtt 


tac 


agg 


gag 


gaa 


gaa 


gaa 


gaa 


gaa 


1636 


Gly 


Ala 


Gly 


Glu 


Arg 


Arg 


Arg 


Val 


Tyr 


Arg 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 










525 










530 










535 








gag 


gag 


ggc 


cac 


tat 


ccc 


cca 


gca 


cct 


ccg 


cct 


tac 


tct 


gag 


act 


gac 


1684 


Glu 


Glu 


Gly 


His 


Tyr 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 


Glu 


Thr 


Asp 








540 










545 










550 










teg 


cag 


gee 


teg 


agg 


gag 


egg 


agg 


atg 


aaa 


aag 


aat 


ttg 


gee 


ctg 


agt 


1732 


Ser 


Gin 


Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


Met 


Lys 


Lys 


Asn 


Leu 


Ala 


Leu 


Ser 






555 










560 










565 












egg 


gaa 


agt 


tta 


gtc 


gtc 


tga 


tcccacgttt tgttatgtag cttttatact 


1783 


Arg 


Glu 


Ser 


Leu 


Val 


Val 


* 























570 575 
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<210> 16 

<211> 1682 

<212> DNA 

<213> Mus musculus 

<400> 16 

gcaccgtcgc tgctagacgg ccgcg atg gcg 

Met Ala 
1 



ggg 


gcg 


cct 


ggc 


tec 


cac 


ccg 


gee 


acc 


Gly 


Ala 


Pro 


Gly 


Ser 


His 


Pro 


Ala 


Thr 


10 










15 








etc 


ate 


att 


tac 


tgc 


cca 


gac 


cgt 


gee 


Leu 


He 


He 


Tyr 


Cys 
30 


Pro 


Asp 


Arg Ala 


cct 


gac 


ccc 


tac 


cac 


gta 


gtg 


ate 


ctg 


Pro 


Asp 


Pro 


Tyr 


His 


val 


val 


He 


Leu 








45 










50 


tgc 


acc 


tac 


cag 


atg 


age 


aat 


acc 


etc 


Cys 


Thr 


Tyr 


Gin 


Met 


Ser 


Asn 


Thr 


Leu 






60 










65 




aag 


tat 


aag 


teg 


ttc 


tgt 


egg 


gac 


cgt 


Lys 


Tyr 


Lys 


Ser 


Phe 


Cys 


Arg 


Asp Arg 




75 










80 






gec 


age 


gtg 


gac 


aac 


cag 


etc 


aac 


gee 


Ala 


Ser 


Val 


Asp 


Asn 


Gin 


Leu 


Asn 


Ala 


90 










95 








ggc 


tac 


aac 


ccc 


tat 


gtg 


gag 


tgc 


cag 


Gly 


Tyr 


Asn 


Pro 


Tyr 


Val 


Glu 


Cys 


Gin 










110 










gtg 


gtg 


gec 


acc 


aaa 


cag 


ggc 


aat 


get 


Val 


Val 


Ala 


Thr 


Lys 


Gin 


Gly 


Asn 


Ala 








125 










130 


cag 


ggc 


agg 


aga 


ate 


acc 


ate 


aca 


gga 


Gin 


Gly 


Arg 


Arg 


He 


Thr 


He 


Thr Gly 






140 










145 




cag 


acg 


gee 


tgg 


gga 


gac 


agt 


gga 


gtg 


Gin 


Thr 


Ala 


Trp 


Gly 


Asp 


Ser 


Gly Val 




155 










160 






gec 


caa 


gat 


ctg 


gat 


ggg 


aac 


aac 


gag 


Ala 


Gin 


Asp 


Leu 


Asp 


Gly 


Asn 


Asn 


Glu 


170 










175 








ctt 


gtt 


tat 


get 


get 


ggc 


aaa 


gca 


gec 


Leu 


Val 


Tyr 


Ala 


Ala 


Gly 


Lys 


Ala 


Ala 










190 










tat 


gec 


ccc 


age 


ate 


tat 


acc 


cac 


etc 


Tyr 


Ala 


Pro 


Ser 


He 


Tyr 


Thr 


His 


Leu 








205 










210 


cct 


ccg 


cct 


gec 


atg 


att 


ccc 


atg 


cgt 


Pro 


Pro 


Pro 


Ala 


Met 


He 


Pro 


Met Arg 






220 










225 




gac 


ttt 


gac 


agg 


acc 


age 


tea 


gtt 


ggt 


Asp 


Phe 


Asp 


Arg 


Thr 


Ser 


Ser 


Val Gly 




235 










240 






ctg 


ctg 


cgt 


gaa 


gtg 


gat 


ggg 


age 


gta 


Leu 


Leu 


Arg 


Glu 


Val 


Asp 


Gly 


Ser 


Val 


250 










255 








tac 


agg 


ate 


cag 


get 


aac 


cag 


caa 


gat 


Tyr 


Arg 


He 


Gin 


Ala 


Asn 


Gin 


Gin Asp 










270 










tat 


atg 


gag 


aag 


gag 


eta 


gec 


aac 


ttc 


Tyr 


Met 


Glu 


Lys 


Glu 


Leu 


Ala 


Asn 


Phe 



ccg 


gcg 


gee 


age 


gcg 


tgt 


get 


52 


Pro 


Ala 


Ala 


Ser 


Ala 


Cys 


Ala 




acg 


ate 


5 

ttc 


gtg 


tgt 


ctt 


ttt 


100 


Thr 


He 


Phe 


Val 


Cys 


Leu 


Phe 






20 










25 




agt 


gec 


ate 


cag 


gtg 


acc 


gtg 


148 


Ser 


Ala 


He 


Gin 


Val 


Thr 


Val 




35 










40 






ttc 


cag 


cca 


gtg 


aca 


eta 


cac 


196 


Phe 


Gin 


Pro 


Val 


Thr 


Leu 


His 












55 








aca 


gee 


cct 


ate 


gtg 


ate 


tgg 


244 


Thr 


Ala 


Pro 


He 


Val 


He 


Trp 










70 










gtt 


gec 


gac 


gee 


ttc 


tec 


cct 


292 


Val Ala Asp 


Ala 


Phe 


Ser 


Pro 








85 












cag 


ctg 


gcg 


get 


ggc 


aac 


ccc 


340 


Gin 


Leu 


Ala 


Ala Gly Asn 


Pro 






100 










105 




gac 


age 


gta 


cgc 


act 


gtc 


agg 


388 


Asp 


Ser 


Val 


Arg 


Thr 


Val 


Arg 




115 










120 






gtg 


acc 


ctg 


gga 


gac 


tac 


tac 


436 


val 


Thr 


Leu 


Gly Asp Tyr 


Tyr 












135 








aat 


get 


ggc 


ctg 


acc 


ttc 


gag 


484 


Asn Ala Gly 


Leu 


Thr 


Phe 


Glu 










150 










tat 


tac 


tgc 


tec 


gtg 


gtc 


tea 


532 


Tyr Tyr Cys 


Ser 


Val 


Val 


Ser 








165 












gcg 


tac 


gca 


gag 


etc 


att 


gtc 


580 


Ala Tyr Ala 


Glu 


Leu 


He 


Val 






180 










185 




acc 


tea 


ggt 


gtg 


cca 


age 


ate 


628 


Thr Ser Gly 


Val 


Pro 


Ser 


He 




195 










200 






tct 


cct 


gee 


aag 


act 


ccg 


cca 


676 


Ser 


Pro 


Ala 


Lys 


Thr 


Pro 


Pro 












215 








cct 


ccc 


tat 


ggg 


tac 


cct 


gga 


724 


Pro 


Pro Tyr 


Gly 


Tyr 


Pro 


Gly 










230 










ggc 


cac 


age 


tec 


cag 


gtg 


ccc 


772 


Gly His Ser 


Ser 


Gin 


Val 


Pro 








245 












tct 


tea 


gaa 


gta 


cga 


agt 


ggc 


820 


Ser 


Ser 


Glu 


Val 


Arg 


Ser 


Gly 






260 










265 




gac 


tec 


atg 


agg 


gtc 


eta 


tac 


868 


Asp Ser Met 


Arg 


Val 


Leu 


Tyr 




275 










280 






gat 


cct 


tec 


egg 


cct 


ggc 


cct 


916 


Asp 


Pro 


Ser 


Arg 


Pro 


Gly 


Pro 
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285 290 295 



ccc aat 


ggc 


cga 


gtg 


gaa 


egg 


gee 


atg 


agt 


gaa 


gta 


ace 


tec 


_ _ 
etc 


cat 




Pro Asn 


Gly 
300 


Arg 


Val 


Glu 


Arg 


Ala 
305 


Met 


Ser 


Glu 


Val 


Thr 

Tin 
310 


O A <V- 

ser 


T nu 

Leu 


HIS 




gaa gat 


gac 


tgg 


cga 


tct 


egg 


cct 


tec 


agg 


get 


cct 


gee 


etc 


aca 


CCC 


1012 


Glu Asp 


Asp 


Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


Arg 


Ala 


Pro 


Ala 


Leu 


Thr 


Pro 




315 










320 










325 












ate agg 


gat 


gag 


gag 


tgg 


aat 


cgc 


cac 


tec 


cct 


egg 


agt 


ccc 


aga 


aca 


1060 


lie Arg 


Asp 


Glu 


Glu 


Trp 


Asn 


Arg 


His 


Ser 


Pro 


Arg 


Ser 


Pro 


Arg 


Thr 




330 








335 










340 










345 




tgg gag 


cag 


gaa 


ccc 


ctt 


caa 


gaa 


cag 


cca 


agg 


ggt 


ggt 


tgg 


ggg 


tct 


1108 


Trp Glu 


Gin 


Glu 


Pro 


Leu 


Gin 


Glu 


Gin 


Pro 


Arg 


Gly 


Gly 


Trp 


Gly Ser 










350 










355 










360 






ggg egg 


cct 


egg 


gec 


cgc 


tct 


gtg 


gat 


get 


eta 


gat 


gac 


ate 


aac 


egg 


1156 


Gly Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Val 


Asp 


Ala 


Leu 


Asp 


Asp 


He 


Asn Arg 








365 










370 










375 








cct ggc 


tec 


act 


gaa 


tea 


gga 


agg 


tct 


tct 


ccc 


cca 


agt 


agt 


gga 


egg 


1204 


Pro Gly 


Ser 


Thr 


Glu 


Ser 


Gly 


Arg 


Ser 


Ser 


Pro 


Pro 


Ser 


Ser 


Gly Arg 






380 










385 










390 










aga ggg 


egg 


gee 


tat 


gca 


cct 


ccg 


aga 


agt 


cgc 


age 


egg 


gat 


gac 


etc 


1252 


Arg Gly 


Arg 


Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 


Ser 


Arg 


Ser 


Arg 


Asp 


Asp 


Leu 




395 










400 










405 












tat gac 


ccc 


gac 


gat 


cct 


aga 


gac 


ttg 


cca 


cat 


tec 


cga 


gat 


CCC 


cac 


1300 


Tyr Asp 


Pro 


Asp 


Asp 


Pro 


Arg 


Asp 


Leu 


Pro 


His 


Ser 


Arg 


Asp 


Pro 


His 




410 








415 










420 










425 




tat tat 


gat 


gat 


ttg 


agg 


tct 


agg 


gat 


cca 


cgt 


get 


gac 


ccc 


aga 


tec 


1348 


Tyr Tyr 


Asp 


Asp 


Leu 
430 


Arg 


Ser 


Arg 


Asp 


Pro 
435 


Arg 


Ala 


Asp 


Pro 


Arg 
440 


Ser 




cgt cag 


cga 


tec 


cac 


gat 


cct 


egg 


gat 


get 


ggc 


ttc 


agg 


tea 


egg 


gac 


1396 


Arg Gin 


Arg 


Ser 


His 


Asp 


Pro 


Arg 


Asp 


Ala 


Gly 


Phe 


Arg 


Ser 


Arg Asp 








445 










450 










455 








cct cag 


tat 


gat 


ggg 


cga 


etc 


tta 


gaa 


gag 


get 


tta 


aag 


aaa 


aaa 


ggg 


1444 


Pro Gin 


Tyr 
460 


Asp 


Gly 


Arg 


Leu 


Leu 
465 


Glu 


Glu 


Ala 


Leu 


Lys 
470 


Lys 


Lys 


Gly 




get ggg 


gag 


aga 


aga 


cgc 


gtt 


tac 


agg 


gag 


gaa 


gaa 


gaa 


gaa 


gaa 


gag 


1492 


Ala Gly 


Glu 


Arg 


Arg 


Arg 


Val 


Tyr 


Arg 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 




475 










480 










485 












gag ggc 


cac 


tat 


ccc 


cca 


gca 


cct 


ccg 


cct 


tac 


tct 


gag 


act 


gac 


teg 


1540 


Glu Gly 


His 


Tyr 


Pro 


Pro 


Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 


Glu 


Thr 


Asp 


Ser 




490 








495 










500 










505 




cag gee 


teg 


agg 


gag 


egg 


agg 


atg 


aaa 


aag 


aat 


ttg 


gee 


ctg 


agt 


egg 


1588 


Gin Ala 


Ser 


Arg 


Glu 


Arg 


Arg 


Met 


Lys 


Lys 


Asn 


Leu 


Ala 


Leu 


Ser Arg 










510 










515 










520 






gaa agt 


tta 


gtc 


gtc 


tga 


tcccacgttt tgttatgtag cttttatact 




1636 


Glu Ser 


Leu 


Val 
525 


Val 


* 

























tttttaattg gaatattgat gaaactcttc accaagccta ataaaa 1682 



<210> 17 

<211> 594 

<212> PRT 

<213> MUS 



musculus 



<400> 17 

Met Ala Pro Ala Ala Ser Ala Cys 
1 5 
Ala Thr Thr He Phe Val Cys Leu 
20 

Arg Ala Ser Ala He Gin Val Thr 

35 40 
He Leu Phe Gin Pro Val Thr Leu 

50 55 
Thr Leu Thr Ala Pro He Val He 



Ala Gly Ala Pro Gly Ser His Pro 

10 15 
Phe Leu He He Tyr Cys Pro Asp 
25 30 
Val Pro Asp Pro Tyr His Val Val 
45 

His Cys Thr Tyr Gin Met Ser Asn 
60 

Trp Lys Tyr Lys Ser Phe Cys Arg 
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65 










70 


Asp 


Arg 


Val 


Ala 


Asp 


Ala 










85 




Asn 


Ala 


Gin 


Leu 


Ala 


Ala 








100 






Cys 


Gin 


Asp 


Ser 


Val 


Arg 






115 








Asn 


Ala 


Val 


Thr 


Leu 


Gly 




130 










Thr Gly 


Asn 


Ala 


Gly 


Leu 


145 










150 


Gly Val 


Tyr 


Tyr 


Cys 


Ser 










165 




Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 








180 






Pro 


Glu 


Leu 


Leu 


Pro 


Gly 






195 








Phe 


Val 


Val 


Val 


Val 


Cys 




210 










Gly He 


Cys 


Trp 


Cys 


Gin 


225 










230 


Arg Cys 


Pro 


Cys 


Cys 


Pro 










245 




Ala 


Ala 


Gly 


Lys 


Ala 


Ala 








260 






Ser 


He 


Tyr 


Thr 


His 


Leu 






275 








Ala 


Met 


He 


Pro 


Met 


Arg 




290 










Arg 


Thr 


Ser 


Ser 


Val 


Gly 


305 










310 


Glu 


Val 


Asp 


Gly 


Ser 


Val 










325 




Gin 


Ala 


Asn 


Gin 


Gin 


Asp 








340 






Lys 


Glu 


Leu 


Ala 


Asn 


Phe 






355 








Arg Val 


Glu 


Arg 


Ala 


Met 




370 










Trp 


Arg 


Ser 


Arg 


Pro 


Ser 


385 










390 


Glu 


Glu 


Trp 


Asn 


Arg 


His 










405 




Glu 


Pro 


Leu 


Gin 


Glu 


Gin 








420 






Arg 


Ala 


Arg 


Ser 


Val 


Asp 






435 








Thr 


Glu 


Ser 


Gly 


Arg 


Ser 




450 










Ala 


Tyr 


Ala 


Pro 


Pro 


Arg 


465 










470 


Asp 


Asp 


Pro 


Arg 


Asp 


Leu 










485 




Asp 


Leu 


Arg 


Ser 


Arg 


Asp 








500 






Ser 


His 


Asp 


Pro 


Arg 


Asp 






515 








Asp 


Gly 


Arg 


Leu 


Leu 


Glu 




530 










Arg Arg 


Arg 


Val 


Tyr 


Arg 


545 










550 


Tyr 


Pro 


Pro 


Ala 


Pro 


Pro 



565 



52 

75 



Phe 


Ser 


Pro 


Ala 


Ser 


Val 








90 






Gly 


Asn 


Pro 


Gly 


Tyr 


Asn 






105 








Thr 


Val 


Arg 


Val 


Val 


Ala 




120 










Asp 


Tyr 


Tyr 


Gin 


Gly 


Arg 


135 










140 


Thr 


Phe 


Glu 


Gin 


Thr 


Ala 










155 




Val 


Val 


Ser 


Ala 


Gin 


Asp 








170 






Leu 


He 


Val 


Leu 


Gly 


Arg 






185 








Phe 


Arg 


Ala 


Gly 


Pro 


Leu 




200 










Leu 


Ala 


Ser 


Leu 


Leu 


Phe 


215 










220 


Cys 


Cys 


Pro 


His 


Thr 


Cys 










235 




Asp 


Lys 


Cys 


Cys 


Cys 


Pro 








250 






Thr 


Ser 


Gly 


Val 


Pro 


Ser 






265 








Ser 


Pro 


Ala 


Lys 


Thr 


Pro 




280 










Pro 


Pro 


Tyr 


Gly 


Tyr 


Pro 


295 










300 


Gly 


His 


Ser 


Ser 


Gin 


Val 










315 




Ser 


Ser 


Glu 


Val 


Arg 


Ser 








330 






Asp 


Ser 


Met 


Arg 


Val 


Leu 






345 








Asp 


Pro 


Ser 


Arg 


Pro 


Gly 




360 










Ser 


Glu 


Val 


Thr 


Ser 


Leu 


375 










380 


Arg 


Ala 


Pro 


Ala 


Leu 


Thr 










395 




Ser 


Pro 


Arg 


Ser 


Pro 


Arg 








410 






Pro 


Arg 


Gly 


Gly 


Trp 


Gly 






425 








Ala 


Leu 


Asp 


Asp 


He 


Asn 




440 










Ser 


Pro 


Pro 


Ser 


Ser 


Gly 


455 










460 


Ser 


Arg 


Ser 


Arg 


Asp 


Asp 










475 




Pro 


His 


Ser 


Arg 


Asp 


Pro 








490 






Pro 


Arg 


Ala 


Asp 


Pro 


Arg 






505 








Ala 


Gly 


Phe 


Arg 


Ser 


Arg 




520 










Glu 


Ala 


Leu 


Lys 


Lys 


Lys 


535 










540 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 










555 




Pro 


Tyr 


Ser 


Glu 


Thr 


Asp 



570 
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80 



Asp 


Asn 


Gin 


Leu 






95 




Pro 


Tyr 


Val 


Glu 




110 






Thr 


Lys 


Gin 


Gly 


125 








Arg 


He 


Thr 


He 


Trp 


Gly 


Asp 


Ser 








160 


Leu 


Asp 


Gly 


Asn 






175 




Thr 


Ser 


Glu 


Ala 




190 






Glu 


Asp 


Trp 


Leu 


205 








Phe 


Leu 


Leu 


Leu 


Cys 


Cys 


Tyr 


Val 








240 


Glu 


Ala 


Leu 


Tyr 






255 




He 


Tyr 


Ala 


Pro 




270 






Pro 


Pro 


Pro 


Pro 


285 








Gly 


Asp 


Phe 


Asp 


Pro 


Leu 


Leu 


Arg 








320 


Gly 


Tyr 


Arg 


lie 






335 




Tyr 


Tyr 


Met 


Glu 




350 






Pro 


Pro 


Asn 


Gly 


365 








His 


Glu 


Asp 


Asp 


Pro 


He 


Arg 


Asp 








400 


Thr 


Trp 


Glu 


Gin 






415 




Ser 


Gly 


Arg 


Pro 




430 






Arg 


Pro 


Gly 


Ser 


445 








Arg 


Arg 


Gly 


Arg 


Leu 


Tyr 


Asp 


Pro 








480 


His 


Tyr 


Tyr 


Asp 






495 




Ser 


Arg 


Gin 


Arg 




510 






Asp 


Pro 


Gin 


Tyr 


525 








Gly 


Ala 


Gly 


Glu 


Glu 


Glu 


Gly 


His 








560 


Ser 


Gin 


Ala 


Ser 






575 
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Arg Glu Arg Arg Met Lys 
580 

Val Val 

<210> 18 
<211> 575 
<212> PRT 

<213> Mus musculus 



<400> 18 



Met 


Ala 


Pro 


Ala 


Ala 


Ser 


1 








5 




Ala 


Thr 


Thr 


lie 


Phe 


Val 








20 






Arg 


Ala 


Ser 


Ala 


He 


Gin 






35 








lie 


Leu 


Phe 


Gin 


Pro 


Val 




50 










Thr 


Leu 


Thr 


Ala 


Pro 


He 


65 










70 


Asp 


Arg 


Val 


Ala 


Asp Ala 










85 




Asn 


Ala 


Gin 


Leu 


Ala 


Ala 








100 






Cys 


Gin 


Asp 


Ser 


Val 


Arg 






115 








Asn 


Ala 


Val 


Thr 


Leu Gly 




130 










Thr 


Gly 


Asn 


Ala 


Gly Leu 


145 










150 


Gly 


Val 


Tyr 


Tyr 


Cys 


Ser 










165 




Asn 


Glu 


Ala 


Tyr 


Ala 


Glu 








180 






Val 


Val 


Cys 


Leu 


Ala 


Ser 






195 








Trp 


Cys 


Gin 


Cys 


Cys 


Pro 




210 










Cys 


Cys 


Pro 


Asp 


Lys 


Cys 


225 










230 


Lys 


Ala 


Ala 


Thr 


Ser Gly 










245 




Thr 


His 


Leu 


Ser 


Pro 


Ala 








260 






Pro 


Met 


Arg 


Pro 


Pro 


Tyr 
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75 










80 


lie Tyr Gin 


Gin 


lie 


Leu 


He 


Asn 


Leu 


Pro 


Ser 


Arg 


Asn 


Val 


He 


Gin 






85 










90 










95 




He Ser Asn Asp 


Leu 


C*\ ti 
. V7-LU 


Asn 


Leu 


Arg 


Asp 


Leu 


Leu 


His 


Leu 


Leu 


Ala 




100 










105 










110 






Phe Ser Lys 


Ser 


Cys 


Hi q 
ni e> 


Leu 


Pro 


Leu 


Ala 


Ser 


Gly 


Leu 


Glu 


Thr 


Leu 


115 










120 










125 








Glu Ser Leu Gly 


Asp 


Val 


Leu 


Glu 


Ala 


Ser 


Leu 


Tyr 


Ser 


Thr 


Glu 


Val 


130 








135 










140 










Val Ala Leu 


Ser 


Arg 


Leu 


Gin 


Gly 


Ser 


Leu 


Gin 


Asp 


Met 


Leu 


Trp 


Gin 


145 






150 










155 










160 


Leu Asp Leu 


Ser 


Pro 


Glv 


Cys 
























165 
























<210> 34 




























<211> 167 




























<212> PRT 




























<213> Mus musculus 
























<400> 34 




























Met Cys Trp 


Arg 


Pro 


Leu 


Cys 


Arg 


Phe 


Leu 


Trp 


Leu 


Trp 


Ser 


Tyr 


Leu 


1 




5 










10 










15 




Ser Tyr Val 


Gin 


Ala 


Val 


Pro 


He 


Gin 


Lys 


Val 


Gin 


Asp 


Asp 


Thr 


Lys 




20 










25 










30 






Thr Leu He 


Lys 


Thr 


He 


Val 


Thr 


Arg 


He 


Asn 


Asp 


He 


Ser 


His 


Thr 


35 










40 










45 








Gin Ser Val 


Ser 


Ala 


Lys 


Gin 


Arg 


Val 


Thr 


Gly 


Leu 


Asp 


Phe 


He 


Pro 


50 








55 










60 










Gly Leu His 


Pro 


He 


Leu 


Ser 


Leu 


Ser 


Lys 


Met 


Asp 


Gin 


Thr 


Leu 


Ala 


65 






70 










75 










80 


Val Tyr Gin 


Gin 


Val 


Leu 


Thr 


Ser 


Leu 


Pro 


Ser 


Gin 


Asn 


Val 


Leu 


Gin 






85 










90 










95 




He Ala Asn 


Asp 


Leu 


Glu 


Asn 


Leu 


Arg 


Asp 


Leu 


Leu 


His 


Leu 


Leu 


Ala 




100 










105 










110 






Phe Ser Lys 


Ser 


Cys 


Ser 


Leu 


Pro 


Gin 


Thr 


Ser 


Gly 


Leu 


Gin 


Lys 


Pro 


115 










120 










125 








Glu Ser Leu 


Asp 


Gly 


Val 


Leu 


Glu 


Ala 


Ser 


Leu 


Tyr 


Ser 


Thr 


Glu 


Val 


130 








135 










140 
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60 



Val Ala Leu Ser Arg Leu 


Gin 


Gly 


Ser 


Leu 


Gin 


Asp 


He 


Leu 


Gin 


Gin 


145 


150 










155 










160 


Leu Asp Val Ser Pro Glu 


Cys 






















165 






















<210> 35 






















<211> 146 






















<212> PRT 






















<213> Ovus aries 






















<400> 35 






















Val Pro 


lie Arg Lys Val 


Gin 


Asp 


Asp 


Thr 


Lys 


Thr 


Leu 


He 


Lys 


Thr 


1 


5 








10 










15 




lie Val 


Thr Arg lie Asn 


Asp 


He 


Ser 


His 


Thr 


Gin 


Ser 


val 


Ser 


Ser 




20 






25 










30 






Lys Gin 


Arg Val Thr Gly 


Leu 


Asp 


Phe 


He 


Pro 


Gly 


Leu 


His 


Pro 


Leu 




35 




40 










45 








Leu Ser 


Leu Ser Lys Met 


Asp 


Gin 


Thr 


Leu 


Ala 


He 


Tyr 


Gin 


Gin 


He 


50 




55 










60 










Leu Ala 


Ser Leu Pro Ser 


Arg 


Asn 


val 


He 


Gin 


He 


Ser 


Asn 


Asp 


Leu 


65 


70 










75 










80 


Glu Asn 


Leu Arg Asp Leu 


Leu 


His 


Leu 


Leu 


Ala 


Ala 


Ser 


Lys 


Ser 


Cys 




85 








90 










95 




Pro Leu 


Pro Gin Val Arg 


Ala 


Leu 


Glu 


Ser 


Leu 


Glu 


Ser 


Leu 


Gly 


Val 




100 






105 










110 






Val Leu 


Glu Ala Ser Leu 


Tyr 


Ser 


Thr 


Glu 


Val 


Val 


Ala 


Leu 


Ser 


Arg 




115 




120 










125 








Leu Gin 


Gly Ser Leu Gin 


Asp 


Met 


Leu 


Arg 


Gin 


Leu 


Asp 


Leu 


Ser 


Pro 


130 




135 










140 










Gly Cys 
























145 
























<210> 36 






















<211> 146 






















<212> PRT 






















<213> Pan troglodytes 






















<400> 36 






















Val Pro 


lie Gin Lys Val 


Gin 


Asp 


Asp 


Thr 


Lys 


Thr 


Leu 


He 


Lys 


Thr 


1 


5 








10 










15 




lie Val 


Thr Arg lie Asn 


Asp 


He 


Ser 


His 


Thr 


Gin 


Ser 


Val 


Ser 


Ser 




20 






25 










30 






Lys Gin 


Lys Val Thr Gly 


Leu 


Asp 


Phe 


He 


Pro 


Gly 


Leu 


His 


Pro 


He 




35 




40 










45 








Leu Thr 


Leu Ser Lys Met 


Asp 


Gin 


Thr 


Leu 


Ala 


Val 


Tyr 


Gin 


Gin 


He 


50 




55 










60 










Leu Thr 


Ser Met Pro Ser 


Arg 


Asn 


Met 


He 


Gin 


He 


Ser 


Asn 


Asp 


Leu 


65 


70 










75 










80 


Glu Asn 


Leu Arg Asp Leu 


Leu 


His 


Val 


Leu 


Ala 


Phe 


Ser 


Lys 


Ser 


Cys 




85 








90 










95 




His Leu 


Pro Trp Ala Ser 


Gly 


Leu 


Glu 


Thr 


Leu 


Asp 


Ser 


Leu 


Gly 


Gly 




100 






105 










110 






Val Leu 


Glu Ala Ser Gly 


Tyr 


Ser 


Thr 


Glu 


Val 


Val 


Ala 


Leu 


Ser 


Arg 




115 




120 










125 








Leu Gin 


Gly Ser Leu Gin 


Asp 


Met 


Leu 


Trp 


Gin 


Leu 


Asp 


Leu 


Ser 


Pro 


130 




135 










140 











Gly Cys 
145 

<210> 37 
<211> 146 
<212> PRT 

<213> Pongo pygmaeus 
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<400> 37 



Val Pro lie Gin Lys Val 


Gin 


Asp 


Asp 


Thr 


LVS 


Thr 


Leu 


He 


Lvs 


Thr 


1 5 










10 










15 




Val He Thr Arg He Asn 


Asp 


He 


Ser 


His 


Thr 


Gin 


Ser 


Val 


Ser 


Ser 


20 


















30 






Lys Gin Lys Val Thr Gly 






Phe 


He 


Pro 


Glv 


Leu 


His 


Pro 


He 


35 






ft U 










45 








Leu Thr Leu Ser Lys 


Met 


Asp 


VJ J. 11 


Thr 


Leu 


Ala 


Val 


Tvr 


Gin 


Gin 


He 


50 




3D 










fi 0 

O V 










Leu Thr Ser Met Pro 


Ser 


Arg 


Asn 


Val 


11C 


Gin 


He 


Ser 


Asn 


Asp 


Leu 


65 


70 




















R 0 
o y 


Glu Asn Leu Arg Asp 


Leu 


T.pi i 
JJCU 


nio 


Val 

VAX 


Leu 


Ala 


Phe 


Ser 


Lys 


Ser 


Cys 


85 










90 










95 




His Leu Pro Trp Ala Ser 


Gly 


Leu 


Glu 


Thr 


Leu 


Asp 


Arg 


Leu 


Gly 


Gly 


100 


















lift 

llv 






Val Leu Glu Ala Ser Gly 


Tyr 


ser 


inr 




vai 


Val 


Kla 


Leu 


Onr 
wCl 


Arg 


115 






120 










±Z D 








Leu Gin Arg Ser Leu 


Gin 


Asp 


Met 


Leu 


Trp 


Gin 


Leu 


Asp 


Leu 


Ser 


Pro 


130 




135 










140 










Gly Cys 
























145 
























<210> 38 
























<211> 167 
























<212> PRT 
























<213> Rattus norvegicus 






















<4UU> JO 
























Met Cys Trp Arg Pro 


Leu 


Cys 


Arg 


pne 


T All 

jueu 


Trp 


Leu 


Trp 


Cat* 

ser 


Tyr 


Leu 


1 5 










10 










15 




Ser Tyr Val Gin Ala 


Val 


Pro 


He 


t 

His 


Lys 


vai 


Gin 


Asp 


Asp 


inr 


Lys 


20 








25 










^0 






Thr Leu He Lys Thr 


He 


vai 


Tnr 


Arg 


Tl Q 

lie 


Asn 


Asp 


Tip 

lie 




nis 


Thr 
i in 


35 






40 










A C 








Gin Ser Val Ser Ala 


Arg 


/IT „ 


Arg 


vai 


inr 


vi±y 


Leu 


Asp 


DVio 

riie 


11c 


xri U 


50 




55 










c n 










Gly Leu His Pro He 


Leu 


ser 


Lieu 


Ser 


Lys 


ncL 


Asp 


fin 

bin 


inr 


Leu 


Hid 


65 


70 










•"J C 

/5 










o u 


Val Tyr Gin Gin He 


Leu 


Thr 


ser 


Leu 


Pro 


Ser 


i?in 


Asn 


vai 


T All 


ulu 


85 










on 














He Ala His Asp Leu 


Glu 


Asn 


Leu 


Arg 


Asp 


Leu 


Leu 


nio 


Leu 


Leu 


Mid 


100 








105 










110 






Phe Ser Lys Ser Cys 


Ser 


Leu 


Pro 


Gin 


Thr 


Arg 


Gly 


Leu 


Gin 


Lys 


Pro 


115 






ion 










1 oc 

1 ^ D 








Glu Ser Leu Asp Gly 


Val 


Leu 




&l a 


OCI 


T.pi 1 

lieu 


xyr 


C a -y- 


Thr 


VJlU 


VOl 


130 




1 J D 










in w 










Val Ala Leu Ser Arg 


Leu 


Gin 


Gly 


Ser 


Leu 


Gin 


Asp 


He 


Leu 


Gin 


Gin 


145 


150 










155 










160 


Leu Asp Leu Ser Pro 


Glu 


Cys 




















165 
























<210> 39 
























<211> 167 
























<212> PRT 
























<213> Sus scrofa 
























<400> 39 
























Met Arg Cys Gly Pro 


Leu 


Cys 


Arg 


Phe 


Leu 


Trp 


Leu 


Trp 


Pro 


Tyr 


Leu 


1 5 










10 










15 




Ser Tyr Val Glu Ala 


Val 


Pro 


He 


Trp 


Arg 


Val 


Gin 


Asp 


Asp 


Thr 


Lys 


20 








25 










30 






Thr Leu lie Lys Thr 


He 


Val 


Thr 


Arg 


He 


Ser 


Asp 


He 


Ser 


His 


Met 
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35 40 45 

Gin Ser Val Ser Ser Lys Gin Arg Val Thr Gly Leu Asp Phe lie Pro 

50 55 60 

Gly Leu His Pro Val Leu Ser Leu Ser Lys Met Asp Gin Thr Leu Ala 
65 70 75 80 

lie Tyr Gin Gin He Leu Thr Ser Leu Pro Ser Arg Asn Val He Gin 

85 90 95 

He Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Leu Leu Ala 

100 105 110 

Ser Ser Lys Ser Cys Pro Leu Pro Gin Ala Arg Ala Leu Glu Thr Leu 

115 120 125 

Glu Ser Leu Gly Gly Val Leu Glu Ala Ser Leu Tyr Ser Thr Glu Val 

130 135 140 

Val Ala Leu Ser Arg Leu Gin Gly Ala Leu Gin Asp Met Leu Arg Gin 
145 150 155 160 

Leu Asp Leu Ser Pro Gly Cys 
165 



<210> 40 
<211> 4 
<212> PRT 

<213> Homo sapiens 

<400> 40 

Glu Thr Leu Asp 

1 



<210> 41 
<211> 4 
<212> PRT 

<213> Mus musculus 



<400> 41 

Gin Lys Pro Glu 

1 

<210> 42 
<211> 6 
<212> PRT 

<213> Homo sapiens 



<400> 42 

Leu Asp Ser Leu Gly Gly 
1 5 



<210> 43 
<211> 4 
<212> PRT 

<213> Homo sapiens 

<400> 43 

Glu Lys Leu Glu 

1 



<210> 44 
<211> 4 
<212> PRT 

<213> Homo sapiens 

<400> 44 

Glu Lys Pro Glu 

1 
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<210> 45 

<211> 4 

<212> PRT 

<213> Homo sapiens 

<400> 45 

Glu Lys Pro Asp 

1 

<210> 46 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Thr Pro Asp Ser Leu 
1 5 

<210> 47 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 47 

Gly Leu Gin Thr Leu Asp Ser Leu Gly 
1 5 

<210> 48 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 48 

Gly Gly Val Leu Glu 
1 5 

<210> 49 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Thr Pro Asp Ser Leu Gly 
1 5 

<210> 50 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Ser Leu Gly Gly Val Leu Glu Ala Ser 
1 5 

<210> 51 

<211> 6 

<212> PRT 

<213> Homo sapiens 



<400> 51 

Pro Glu Ser Leu Gly Gly 
1 5 
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<210> 52 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 52 

Pro Asp Ser Leu Gly Gly 
1 5 

<210> 53 

<211> 7 

<212> PRT 

<213> Homo sapiens 

<400> 53 

Leu Gly Gly Val Leu Glu Ala 
1 5 

<210> 54 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 54 

Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys 
15 10 15 

His Leu Pro Trp Ala Ser 
20 

<210> 55 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 55 

Leu Leu His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Trp Ala 
15 10 15 

Ser Gly Leu Glu Thr Leu 
20 

<210> 56 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 56 

Ala Phe Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr 
15 10 15 

Leu Asp Ser Leu Gly Gly 
20 

<210> 57 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly 
15 10 15 

Gly Val Leu Glu Ala Ser 
20 



<210> 58 
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65 

<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 58 

Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val 
15 10 15 

Leu Glu 

<210> 59 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 59 

Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly Gly Val 
15 10 

<210> 60 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 60 

Ala Ser Gly Leu Glu Thr Asp Ser Leu Gly Gly Val Leu Glu Ala Ser 
15 10 15 

Gly Tyr Ser Thr Glu 
20 

<210> 61 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 61 

Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly 
15 10 

<210> 62 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 62 

Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr 
15 10 15 

Glu Val Val Ala Leu Ser 
20 

<210> 63 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 63 

Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val Ala Leu 
15 10 15 

Ser Arg Gly Gin Gly Ser 
20 



<210> 64 
<211> 22 
<212> PRT 
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<213> Mus musculus 
<400> 64 

Glu Asn Leu Arg Asp Leu Leu His Leu Leu Ala Phe Ser Lys Ser Cys 
15 10 15 

Ser Leu Pro Gin Thr Ser 
20 

<210> 65 
<211> 22 
<212> PRT 

<213> Mus musculus 
<400> 65 

Leu Leu His Leu Leu Ala Phe Ser Lys Ser Cys Ser Leu Pro Gin Thr 
15 10 15 

Ser Gly Leu Gin Lys Pro 
20 

<210> 66 

<211> 22 

<212> PRT 

<213> Mus musculus 

<400> 66 

Ala Phe Ser Lys Ser Cys Ser Leu Pro Gin Thr Ser Gly Leu Gin Lys 
1 5 10 15 

Pro Glu Ser Leu Asp Gly 
20 

<210> 67 

<211> 22 

<212> PRT 

<213> Mus musculus 

<400> 67 

Cys Ser Leu Pro Gin Thr Ser Gly Leu Gin Lys Pro Glu Ser Leu Asp 
15 10 15 

Gly Val Leu Glu Ala Ser 
20 

<210> 68 
<211> 18 
<212> PRT 

<213> Mus musculus 
<400> 68 

Leu Pro Gin Thr Ser Gly Leu Gin Lys Pro Glu Ser Leu Asp Gly Val 
1 5 10 15 

Leu Glu 

<210> 69 
<211> 14 
<212> PRT 

<213> Mus musculus 
<400> 69 

Gin Thr Ser Gly Leu Gin Lys Pro Glu Ser Leu Asp Gly Val 
15 10 



<210> 70 
<211> 22 
<212> PRT 
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<213> Mus musculus 
<400> 70 

Thr Ser Gly Leu Gin Lys Pro Glu Ser Leu Asp Gly Val Leu Glu Ala 
15 10 15 

Ser Leu Tyr Ser Thr Glu 
20 

<210> 71 
<211> 10 
<212> PRT 

<213> Mus musculus 
<400> 71 

Ser Gly Leu Gin Lys Pro Glu Ser Leu Asp 
15 10 

<210> 72 
<211> 22 
<212> PRT 

<213> Mus musculus 
<400> 72 

Lys Pro Glu Ser Leu Asp Gly Val Leu Glu Ala Ser Leu Tyr Ser Thr 
15 10 15 

Glu Val Val Ala Leu Ser 
20 

<210> 73 
<211> 22 
<212> PRT 

<213> Mus musculus 
<400> 73 

Asp Gly Val Leu Glu Ala Ser Leu Tyr Ser Thr Glu Val Val Ala Leu 
15 10 15 

Ser Arg Leu Gin Gly Ser 
20 

<210> 74 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Chimeric oligonucleotides 
<400> 74 

atgcaacagg acggacttgg agtagttttc uacuccaagt cagtccuguu gcaugcgcgt 60 
ttcgcgc 67 



<210> 75 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Forward Primer 

<400> 75 

tgtccacgtc gtttacgctc 



20 



<210> 76 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Reverse Primer 
<400> 76 

tcccacttcc gttccttgtc 20 

<210> 77 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Probes endogenous /mutant 
<400> 77 

cctactccaa gtcmgtcctg ttgcatt 27 

<210> 78 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Chimeric oligonucleotides 
<400> 78 

gaccctgccc tgtacctacc taccagatgt tttcaucugg uaggttcagg gcagggucgc 60 
gcgtttt 67 

<210> 79 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Forward Primer 
<400> 79 

gtggtgatcc tcttccagcc t 21 

<210> 80 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Reverse Primer 
<400> 80 

ccagatgacg atgggttgc 19 

<210> 81 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide Probes endogenous/mutant 



WO 01/21647 



PCT/IBOO/01470 



<400> 81 

accctgccct gwcctaccag atgac 25 

<210> 82 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Chimeric oligonucleotides 
<400> 82 

tggctgagct cttacctggt tttcattttt gaaaaccagg tcagagctca gccagcgcgt 60 
tttcgcgc 68 



<210> 83 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Forward Primer 

<400> 83 

gagctcatcg tccttgggag 

<210> 84 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Reverse Primer 
<400> 84 

agtcttctat gggccccgc 

<210> 85 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



20 



19 



<220> 

<223> oligonucleotide Probes endogenous /mutant 
<400> 85 

caccgactcg agamtggacc aaaagtc 27 

<210> 86 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Chimeric oligonucleotides 
<400> 86 

ggttgtggta tgcctggctg ccttcttttg aaggcagcca gtcataccac aaccgcgcgt 60 
tttcgcgc 68 



<210> 87 
<211> 20 
<212> DNA 



WO 01/21647 



<213> Artificial Sequence 
<220> 

<223> oligonucleotide Forward 
<400> 87 

acgcagagct catcgtcctt 

<210> 88 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Reverse 
<400> 88 

gatgcccagg aggaggaaga 

<210> 89 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Probes 
<400> 89 

caacaccata ckgaccgacg gaa 

<210> 90 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide mouse 
<400> 90 

acgcatggga atcatggc 

<210> 91 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc 
<400> 91 

taggggtgag cggcgggg 

<210> 92 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc 



70 



Primer 



Primer 



endogenous /mutant 



LSR specific primer 



finger nuclotides of SEQID1 



finger nuclotides of SEQID1 



PCT/IBOO/01470 



20 



20 



23 



18 



<220> 

<221> misc_feature 
<222> 10. .12 
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<223> n=a, g, c or t 
<400> 92 

gagggctggn nntaggggtg a 

<210> 93 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc 
<220> 

<221> misc_f eature 

<222> 10. .11 

<223> n=a, g, c or t 

<400> 93 

agggctgggn ntaggggtga 

<210> 94 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc 
<400> 94 

gtgggagccg agggctgg 

<210> 95 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc 
<220> 

<221> misc_feature 
<222> 10 

<223> n=a, g, c or t 
<400> 95 

gtgggagccn agggctggg 

<210> 96 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc 
<400> 96 

gcggcggccg ggtgggag 

<210> 97 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> oligonucleotide zinc finger nuclotides of SEQID1 
<400> 97 

ttggccggag cagatggg 18 



<210> 98 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide Zinc finger nuclotides of SEQID1 
<220> 

<22l> misc_feature 

<222> 10. .11 

<223> n=a, g, c or t 



<400> 98 

gcagatgggn nccggaaggg 20 

<210> 99 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide Zinc finger nuclotides of SEQID1 



<220> 

<221> misc_feature 

<222> 10. .12 

<223> n=a, g, c or t 

<400> 99 

agggctgggn nnaggggtga g 21 

<210> 100 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide Zinc finger nuclotides of SEQID1 



<220> 

<22l> misc_feature 

<222> 10. .12 

<223> n=a # g, c or t 

<400> 100 

aggggtgagn nncggggagg g 21 

<210> 101 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide Zinc finger nuclotides of SEQID1 
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<400> 101 

aagtgggtct cggttgca 



18 



<210> 102 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide zinc finger LSR sequences 
<400> 102 

aaggtcgcct atggtgcaga c 21 

<210> 103 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide zinc finger LSR sequences 



<210> 104 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide zinc finger LSR sequences 
<400> 104 

tgggggtggg cggcgggg 18 

<210> 105 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide zinc finger LSR sequences 
<400> 105 

ccgggagtgc gcagggggta 20 

<210> 106 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide zinc finger LSR sequences 



<400> 103 

gtgggagccc gggggctgga 
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<400> 106 

gtggctgcac aaggtcgcc 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 1 7(2)(a) for the following reasons: 



1. LxJ Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claims 15-17 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the alleged 
effects of the composition. 

2. [X] claims Nos.: 1-21 (partially) 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 

3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
1 — 1 searchable claims. 



2. [_| As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 — 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4 * LxJ No re 9 uired additional search fees were timely paid by the applicant. Consequently, this International Search Report i 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-21 (complete) 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1 )) (July 1998) 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



1. Claims: 1-21 (complete) 

A leptin polypeptide fragment that modulates an activity of 
LSR, comprising at least 4 but not more than 50 contigous 
aminoacids of a polypeptide sequence selected from the group 
consisting of SEQ ID N0:28 (bovine leptin), SEQ ID N0:29 
(dog leptin), SEQ ID N0:30 (chicken leptin), SEQ ID N0:31 
(gorilla leptin), SEQ ID N0:32 (human leptin), SEQ ID N0:33 
(monkey leptin), SEQ ID N0:34 (mouse leptin), SEQ ID N0:35 
(sheep leptin), SEQ ID N0:36 (chimpanzee leptin), SEQ ID 
N0:37 (orangutan leptin), SEQ ID N0:38 (rat leptin), SEQ ID 
N0:39 (pig leptin); polynucleotide encoding said poypeptide, 
recombinant cell, recombinant vectors; pharmaceutical 
compositions comprising said leptin fragment; a method of 
treating an obesity-related disease or disorder by 
administering said pharmaceutical composition to an 
individual; a method of designing mimetics of a leptin 
fragment. 



2. Claims: 22-25 (complete) and 41 (partially) 

A chimeric oligonucleotide comprising at least 9 contiguous 
nucleotides from a sequence selected fromn the group 
consisting of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6 (human 
LSR cDNAs), SEQ ID N0:8, SEQ ID N0:10, SEQ ID N0:12 (rat LSR 
cDNAs), SEQ ID N0:14, SEQ ID N0:15, SEQ ID N0:16 (mouse LSR 
cDNAs), wherein at least 9 contigous nuclei tides comprise at 
least one amino acid codon selected from the group 
consisting of TTA, TTG, TCA, TCG, TAU, TAC, TGT, TGC, TGG, 
CAA, AGA, GAA, GAG or GGA and wherein a point mutaiton is 
present in said codon such that said codon is a stop codon; 
a method of inhibiting expression of one subunit of LSR 
comprising providing to a cell said oligonucleotide; and 
said oligonucleotide comprising 9 contigous nucleotides of 
the sequence of SEQ ID N0:1 comprising a single nucleotide 
polymorphism conssting of Al to A32; a method of treating or 
preventing obesity comprising administering to an individual 
said oligonucleotide. 



3. Claims: 26-40, 42-63 (complete) and 41 (partially) 

A zinc-finger protein comprising a polypeptide that binds 
specifically to 18 nucleotides of a sequence at least 50% 
homologous to SEQ ID N0:1, wherein said 18 nucleotides 
comprise two fragments of 9 contigous nucleotides, and 
wherein said nucleotides are separated by 0,1,2 or 3 
nucleotides; said zinc finger protein wherein said sequence 
is at least 50% homologous to sequences selected from the 
promoter or intronic regions of SEQ ID N0:1; said protein 
further comprising a transcription repressor (KRAB 
repressor) or trascription initiator (VP16) ; a 
polynucleotide encoding said zinc finger protein; a method 
of modulating the expression of at least one subunit of LSR 
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adminstering to a cell said polynucleotide; a recombinant 
vector, a method of treating or preventing obesity 
comprising providing an individual the polynucleotide of 
recombinant vector; a recombinant cell comprising said 
polynucleotide, a method for selecting a compound useful for 
the treatment or prevention of an obesity-related disorder 
comprising contacting said recombinant cell with a compound 
an detecting either a modulation of the expression or of the 
activity of LSR; said method where the compounds is a LSR 
ligand such as cytokine, lipoprotein, free fatty acids, 
Apml, Clq; said method where the detecting step comprises 
FACs analysis or requires antibodies that bind to any of the 
human, mouse or rat leptin polypeptide sequences. 



4. Claims: 64-75 (complete) 

A method for selecting genes that modulate an activity of 
LSR comprising providing a retroviral gene library to cells 
expressing LSR, contacting said cells with a LSR ligand and 
detecting a change in the activity of said LSR; said method 
where the retroviral gene library comprises a detectable 
marker; said method where the LSR ligand is selected from 
cytokine (leptin), lipoprotein, free fatty acids, Apml and 
Clq; said method wherein said detecting comprises 
fluorescent antibodies that bind specificaly to LSR, 
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Continuation of Box I .2 
Claims Nos.: 1-21 (partially) 



Present claims 1-21 relate to an extremely large number of possible of 
polypeptide fragments, namely, every polypeptide comprising at least 4 
but nor more than 50 amino acids of a polypeptide sequence selected from 
the group of SEQ ID N0:28 (bovine leptin), SEQ ID N0:29 (dog leptin), SEQ 
ID N0:30 (chicken leptin), SEQ ID N0:31 (gorilla leptin), SEQ ID N0:32 
(human leptin), SEQ ID N0:33 (monkey leptin), SEQ ID N0:34 (mouse 
leptin), SEQ ID NQ:35 (sheep leptin), SEQ ID N0:36 (chimpanzee leptin), 
SEQ ID N0:37 (orangutan leptin), SEQ ID N0:38 (rat leptin), SEQ ID N0:39 
(pig leptin). Support within the meaning of Article 6 PCT and disclosure 
within the meaning of Article 5 PCT is to be found, however, for only a 
very small proportion of the fragments claimed. In the present case, the 
claims so lack support, and the application so lacks disclosure, that a 
meaningful search over the whole of the claimed scope is impossible. 
Consequently, the search has been carried out for those parts of the 
claims which appear to be supported and disclosed, namely those parts 
relating to the fragments of the human (SEQ ID NO: 32) leptin sequence 
corresponding to SEQ ID NO: 57, 60, 62, 63, 56, 55, 54, 58, 59 and 61 
(Table 1 in example 10) and to the fragments of the mouse leptin (SEQ ID 
N0:34) sequences corresponding to SEQ ID NO: 67, 70, 72, 73, 66, 65, 64, 
68, 69, 71) (table 2 in example 10). 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EP0 policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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